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Preface 


I wrote this book to make the principles of wireless communication more accessible. Wireless 
communication is the dominant means of Internet access for most people, and it has become the means 
by which our devices connect to the Internet and to each other. Despite the ubiquity of wireless, the 
principles of wireless communication have remained out of reach for many engineers. The main 
reason seems to be that the technical concepts of wireless communication are built on the foundations 
of digital communication. Unfortunately, digital communication is normally studied at the end of an 
undergraduate program in electrical engineering, leaving no room for a course on wireless 
communication. In addition, this puts wireless communication out of reach for students in related 
areas like computer science or aerospace engineering, where digital communication may not be 
offered. This book provides a means to learn wireless communication together with the fundamentals 
of digital communication. 


The premise of this book is that wireless communication can be learned with only a background in 
digital signal processing (DSP). The utility of a DSP approach stems from the following fact: 
wireless communication signals (at least ideally) are bandlimited. Thanks to Nyquist’s theorem, it is 
possible to represent bandlimited continuous-time signals from their samples in discrete time. As a 
result, discrete time can be used to represent the continuous-time transmitted and received signals ina 
wireless system. With this connection, channel impairments like multipath fading and noise can be 
written in terms of their discrete-time equivalents, creating a model for the received signal that is 
entirely in discrete time. In this way, a digital communication system can be viewed as a discrete- 
time system. 


Many classical signal processing functions have a role to play in this discrete-time equivalent of 
the digital communication system. Linear time-invariant systems, which are characterized by 
convolution with an impulse response, model multipath wireless channels. Deconvolution is used to 
equalize the effects of the channel. Upsampling, downsampling, and multirate identities find 
application in the efficient implementation of pulse shaping at the transmitter and matched filtering at 
the receiver. Fast Fourier transforms are the foundation of two important modulation/demodulation 
techniques: orthogonal frequency-division multiplexing and single-carrier frequency-domain 
equalization. Linear estimation and least squares become the basis of algorithms for channel 
estimation (estimating an unknown filter response) and equalization (finding a deconvolution filter). 
Algorithms for estimating the parameters of an unknown sinusoid in noise find application in carrier 
frequency offset estimation. In short, signal processing has always been a part of communication; 
leveraging this fact, digital communication can be learned based on connections to signal processing. 


I begin this book with an introduction to wireless communication and signal processing in Chapter 
1, providing some historical context. A highlight of the chapter is the discussion of different 
applications of wireless communication, including broadcast radio and television, cellular 
communication, local area networking, personal area networks, satellite systems, ad hoc networks, 
sensor networks, and even underwater communications. The review of applications gives context for 
subsequent examples and homework problems in the book, which often draw on developments in 
wireless local area networks, personal area networks, or cellular communication systems. 


In the next two chapters, I establish a fundamental background in digital communication and signal 
processing. I start with an overview of the typical block diagram of digital communication systems in 


Chapter 2, with an explanation of each block at the transmitter and the receiver. Important functions 
are described, including source coding, encryption, channel coding, and modulation, along with a 
discussion of the wireless channel. The remainder of the chapter focuses on a subset of these 
functions: modulation, demodulation, and the channel. To provide a proper mathematical background, 
I provide an extensive overview of important concepts from signal processing in Chapter 3, including 
deterministic and stochastic signals, passband and multirate signal processing, and linear estimation. 
This chapter gives tools that are used to describe the operations of the digital communication 
transmitter and receiver froma signal processing perspective. 


With the fundamentals at hand, I continue with a more thorough treatment of modulation and 
demodulation in Chapter 4. Instead of considering all possible modulation formats as would be done 
in a deep-dive treatment, I focus on strategies described by complex pulse-amplitude modulation. 
This is general enough to describe most waveforms used in commercial wireless systems. The 
demodulation procedure is derived assuming an additive white Gaussian noise channel, including 
pulse shaping, maximum likelihood detection, and the probability of symbol error. The key parts of 
the transmitter and the receiver are described using multirate signal processing concepts. This chapter 
forms the basis of a classical introduction to digital communication but with a signal processing flair. 


The full specialization of the receiver algorithms to the wireless environment comes in Chapter 5. 
Specific impairments are described, including symbol timing offset, frame timing offset, carrier 
frequency offset, and frequency-selective channels. Several methods for mitigation are also 
described, including algorithms for estimating unknown parameters based on least squares estimation 
and equalization strategies that leverage alternatively the time or frequency domain. I focus on 
algorithms for dealing with impairments 1n the simplest ways possible, to set the stage for more 
advanced algorithms that might be encountered in the future. The chapter concludes with a description 
of large- and small-scale channel models, and a discussion about how to characterize the time and 
frequency selectivity of a channel. The knowledge in this chapter is essential for the design and 
implementation of any wireless system. 


The final chapter, Chapter 6, provides a generalization of the concepts from Chapters 4 and 5 to 
systems that use multiple transmit and/or multiple receive antennas, generally referred to as MIMO 
communication. In this chapter, I define different MIMO modes of operation and explore them in 
further detail, including receiver diversity, transmitter diversity, and spatial multiplexing. In essence, 
most of the descriptions in prior channels generalize to the MIMO case with suitable vector and 
matrix notation, and additional complexity in the receiver algorithms. While most of the chapter 
focuses on the flat-fading channel model, some generalizations to frequency-selective channels are 
provided at the end. This chapter provides important connections between wireless fundamentals and 
the types of communication systems now widely used in commercial wireless systems. 


I developed this book as part of a course at The University of Texas at Austin (UT Austin) called 
the Wireless Communications Lab, which served both senior undergraduates and new graduate 
students. The lecture portion of the course materials was based on drafts of this book. The lab portion 
uses my laboratory manual Digital Wireless Communication: Physical Layer Exploration Lab 
Using the NI USRP. published by the National Technology and Science Press in 2012. This 
laboratory manual is bundled with a USRP hardware package available from National Instruments. In 
the lab, students implement quadrature amplitude modulation and demodulation. They must deal with 
a succession of more sophisticated impairments, including noise, multipath channels, symbol timing 
offset, frame timing offset, and carrier frequency offset. The lab provides a way to see the concepts in 


this book in action on real wireless signals. For self-study or courses without a laboratory 
component, I have included several computer problems that involve simulating key pieces of the 
communication link. To see the concepts in action, you can implement the described algorithms over 
an audio channel using a microphone and speaker. I have used this approach in the past to prototype a 
High Frequency Near Vertical Incidence Skywave wireless communication link, operating in amateur 
radio bands. 


This book may be used in several different ways, with or without a laboratory component. In a 
junior-level undergraduate course, I would cover Chapters 1 through 5 and spend some extra time at 
the beginning on the mathematical fundamentals. For a senior-level undergraduate course, I would 
cover the material from the entire book. For a graduate course, I would cover the entire book with an 
additional implementation or research project. At UT Austin, the course I teach for both 
undergraduates and graduates covers the entire book and the first several laboratory experiments in 
the aforementioned lab manual. The graduate students also perform a project. It is an intense but 
rewarding course. 


The textbook is highly recommended as independent study, as a comprehensive review of the 
fundamentals is provided in Chapters 2 and 3. I encourage careful review of the mathematical 
fundamentals before proceeding to the main content in Chapters 4 through 6. To digest the material, I 
suggest working out all of the example problems with pen and paper, to help visualize the key ideas. 
Attempting select homework problems is also recommended. I believe that this book will provide a 
good foundation for further studies in wireless communications. 

As wireless communication continues to integrate into our lives, there continues to be a constant 
evolution of wireless technologies. My hope 1s that this book offers a fresh perspective on wireless 
and acts as a launching point for further developments to come. 


Register your copy of /ntroduction to Wireless Digital Communication at informit.com 
for convenient access to downloads, updates, and corrections as they become available. 
To start the registration process, go to informit.com/register and log in or create an 
account. Enter the product ISBN (9780134431796) and click Submit. Once the process 
is complete, you will find any available bonus content under “Registered Products.” 
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Chapter 1. Introduction 


1.1 Introduction to Wireless Communication 


During the last 100 years, wireless communication has invaded every aspect of our lives. Wireless 
communication, though, has existed for much longer than the wire it is replacing. Speech 1s a 
prehistoric example of a wireless system, though it is predated by gestures such as beating on one’s 
chest to display authority (still common with gorillas). Sadly, the distance over which speech 1s 
effective is limited because of the constraints of human acoustic power and the natural reduction in 
power as a function of distance. Early attempts at engineering a wireless communication system 
include smoke signals, torch signals, signal flares, and drums. One of the more successful of these 
was the heliotrope, which used reflections from the sun in a small mirror to convey digital signals. 


The modern notion of wireless communication relies on the transmission and reception of 
electromagnetic waves. The concept was theorized by Maxwell and demonstrated in practice by 
Hertz in 1888 [151]. Others contributed to the early demonstration of wireless communication, 
including Lodge, Bose, and de Moura. 


The earliest examples of wireless communication used what is now known as digital 
communication. The term digital comes from digitus in Latin, which refers to a finger or toe. Digital 
communication is a form of communication that involves conveying information by selecting one 
symbol from a set at any given time. For example, by extending just one finger, a hand can convey one 
of five symbols. Extending two fingers at a time, a hand can convey one of 5 x 4 = 20 symbols. 
Repeating the hand gestures quickly allows multiple symbols to be sent in succession. This is the 
essence of digital communication. 


Digital communication using electromagnetic waves involves varying the parameters of 
continuous-time signals (or analog signals) to send a sequence of binary information, or bits. The 
most common kind of wireline communication system in the 1800s was the telegraph, which used 
Morse code to send digital messages consisting of letters, numbers, stops, and spaces across the 
country, and even the ocean, over wires. A wireless telegraph was patented by Marconi in 1896, 
which is generally accepted as the first wireless (electromagnetic) digital communication system. The 
first transatlantic wireless Morse code message was sent by Marconi in 1901 [48]. The history of 
wireless digital communication is as old as wireless itself. 


Though interest in wireless telegraphy continued, digital communication gave way to analog 
communication as the primary modulation method used in wireless applications until the 1980s. Using 
analog communication, the parameters of a waveform are varied continuously based on a continuous- 
time signal at the input. An early example of analog communication is the original telephone system, 
developed in the late 1870s [55], in which acoustic speech waves were converted via a microphone 
to electrical signals that could be amplified and propagated on a wire. Early examples of wireless 
analog communication, still 1n use today, include AM (amplitude modulation) and FM (frequency 
modulation) radio, and older broadcast TV (television). Analog communication has been widely used 
in wireless communication systems, but it is now being replaced by digital communication. 

The primary reasons that digital communication has now overtaken analog communication are the 
prevalence of digital data and advancements in semiconductor technologies. Digital data was not 
common before the development of computers and computer networks. Nowadays, everything stored 


ona computer or exchanged over the Internet is digital, including e-mail, voice calls, music 
streaming, videos, and Web browsing among others. Advances in integrated circuits have led to 
increasing numbers of transistors in a given amount of semiconductor area, which has increased the 
potential of digital signal processing. While not required for digital communication, leveraging 
digital signal processing allows for much better transmitter and receiver algorithms. In wireline 
telephony, digital communication circuits began to completely replace analog circuits in the network 
backbone in the 1960s, in part because of the noise resilience of digital signals when transmitted over 
long distances (repeaters are less sensitive to noise than amplifiers). Similar developments in 
wireless communication, however, did not start in earnest until the 1980s. The reason, it seems, is 
that it was only in the 1980s that integrated circuit technology had developed to the point where it 
could be considered for use in portable wireless devices. About the same time, the compact disc 
started replacing the tape and vinyl record. 


Digital communication is now a fundamental part of wireless communication. In fact, almost all 
current and next-generation wireless communication systems (actually all developing standards as 
well) make use of digital communication. Wherever there is currently a wire, there is a proposal to 
eliminate that wire via wireless. There are a multitude of commercial, military, and consumer 
applications of wireless digital communication. 


1.2 Wireless Systems 


This section reviews common applications of wireless communication and introduces key 
terminology that facilitates the discussion of wireless communication in practice. Several topics are 
addressed, including broadcast radio and broadcast television, cellular communication, wireless 
local area networks, personal area networks, satellite networks, ad hoc networks, sensor networks, 
and finally underwater communication. The key concepts and the connections to digital 
communication are highlighted along the way. 


1.2.1 Broadcast Radio 


Broadcasting music was one of the first applications of wireless communication. A typical broadcast 
radio or television architecture is illustrated in Figure 1.1. Until recently, radio was still analog, 
being transmitted in the usual AM and FM bands and taking advantage of technology developed in the 
1920s and 1940s, respectively [243]. AM radio, the process of radio broadcasting using amplitude 
modulation, was the dominant method of radio broadcasting during the first 80 years of the twentieth 
century. Because of its susceptibility to atmospheric and electrical interference, AM radio now is 
mainly used for talk radio and news programming. In the 1970s, radio broadcasting shifted to FM 
radio, which uses frequency modulation to provide high-fidelity sound, especially for music radio 
and public radio. 


Satellite 
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Radio tower 


Figure 1.1 Ina radio or television network, signals can be broadcast by a high radio/television 
tower on the ground or by a satellite in the sky. 


In the 1990s, there was a transition of broadcast radio from analog to digital technology. In 1995, 
the digital audio broadcasting (DAB) standard, also known as Eureka 147, was developed [333]. 
DAB is used in Europe and other parts of the world, coexisitng in some cases with traditional AM 
and FM emissions. It uses a digital modulation technique known as COFDM (coded orthogonal 
frequency-division multiplexing) to broadcast multiple digital radio streams [304]. COFDM 1s a 
particular form of OFDM, which is discussed extensively in this book. 


The United States uses a different digital method known as HD Radio (a trademarked name), which 
was approved by the FCC (Federal Communications Commission) 1n 2002 as the AM and FM digital 
broadcasting system to transmit digital audio and data along the existing analog radio signals [317, 
266]. HD Radio uses a proprietary transmission technique, which also uses OFDM but fits in the gaps 
between existing FM broadcast stations. HD Radio started rolling out in force in 2007 in the United 
States. Digital coding and modulation techniques permit compact-disc-quality stereo signals to be 
broadcast from either satellites or terrestrial stations. In addition to audio quality improvement, 
digital audio broadcasting can provide other advantages: additional data services, multiple audio 
sources, and on-demand audio services. Just like today’s analog AM and FM radio, HD Radio 
requires no subscription fee. HD Radio receivers are factory installed in most vehicles at present. 
Therefore, owners of new cars immediately have access to the HD Radio audio and data services 
offered [317]. 


1.2.2 Broadcast Television 


Broadcasting television, after radio, is the other famous application of wireless. Analog television 
broadcasting began in 1936 in England and France, and in 1939 in the United States [233]. Until 
recently, broadcast TV was still using one of several analog standards developed in the 1950s: 
NTSC, named after the National Television System Committee, in the United States, Canada, and 
some other countries; PAL (phase alternating line) in much of Europe and southern Asia; and SECAM 
(séquentiel couleur ad mémoire) in the former Soviet Union and parts of Africa. In addition to 
fundamental quality limitations, analog television systems, by their nature, are rigidly defined and 
constrained to a narrow range of performance that offers few choices. The move to digital television 
enabled a higher level of signal quality (high-definition pictures with high-quality surround-sound 
audio) as well as a wider range of services. 


In the 1990s, the DVB (digital video broadcasting) suite of standards was initiated for digital and 
high-definition digital television broadcasting [274, 275]. DVB standards are deployed throughout 
much of the world, except in the United States. Like DAB, DVB also uses an OFDM digital 
modulation technique. There are several different flavors of DVB specified for terrestrial, satellite, 
cable, and handheld applications [104]. 


The United States chose to follow a different approach for high-definition digital broadcasting that 
produces a digital signal with a similar spectrum to the analog NTSC signal. The ATSC (Advanced 
Television Systems Committee) digital standard employs 8-VSB (vestigial sideband) modulation and 
uses a special trellis-encoder (one of the few examples of trellis-coded modulation in wireless 
systems) [86, 276, 85]. ATSC systems require directional antennas to limit the amount of multipath, 
since equalization is relatively more difficult compared with the OFDM modulation used in the DVB 
standard. In 2009, after over a half-century of use, the analog NTSC systems were replaced by ATSC 
in the United States. 


1.2.3 Cellular Communication Networks 


Cellular communication uses networks of base stations to provide communication with mobile 
subscribers over a large geographic area. The term ce// is used to refer to the area covered bya 
single base station. The base stations are placed such that the cells overlap, to provide mobile users 
with coverage, as shown in Figure 1.2. Clusters of cells share a set of radio frequencies, which are 
reused geographically, to make the most use of limited radio spectrum. Cellular systems support 
handoff, where a connection is transferred from one base station to another as a mobile user moves. 
The base stations are networked together, typically with a wireline network, with several functional 
components to provide services such as roaming and billing. Cellular networks are typically 
connected to the public switched telephone network (the network used for making telephone calls) 
and the Internet. 
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Figure 1.2 The components of a typical cellular system. Each cell has a base station serving 
multiple mobiles/users/subscribers. A backhaul network connects the base stations together and 
permits functions like handoff. Frequencies are reused in clusters of cells. 


The first generation of cellular communication devices used analog communication, in particular 
FM modulation, for the wireless link between mobile users and the base stations. The technology for 
these systems was conceived in the 1960s and deployed in the late 1970s and early 1980s [243, 366, 
216]. The use of analog technology gave little security (it was possible to eavesdrop ona call with 
the right radio gear), and limited data rates were supported. Many similar, but not compatible, first- 
generation systems were deployed around the same time, including AMPS (Advanced Mobile Phone 
System) in the United States, NMT (Nordic Mobile Telephony) in Scandinavia, TACS (Total Access 
Communication System) in some countries in Europe, Radiocom 2000 in France, and RTMI (Radio 
Telefono Mobile Integrato) in Italy. Japan had several different analog standards. The plurality of 
standards deployed in different countries made international roaming difficult. 


The second and subsequent generations of cellular standards used digital communication. Second- 
generation systems were conceived in the 1980s and deployed in the 1990s. The most common 
standards were GSM (Global System for Mobile Communications) [123, 170], IS-95 (Interim 
Standard 1995, also known as TIA-EIJA-95) [139, 122], and the combination IS-54/IS-136 (known as 
Digital AMPS). GSM was developed in a collaboration among several companies in Europe as an 
ETSI (European Telecommunications Standards Institute) standard. It was adopted eventually 
throughout the world and became the first standard to facilitate global roaming. The IS-95 standard 
was developed by Qualcomm and used a new (at the time) multiple access strategy called CDMA 
(Code Division Multiple Access) [137]; therefore, IS-95 was also known as cdmaOne. IS-95 was 
deployed in the United States, South Korea, and several other countries. The [S-54/IS-136 standard 
was developed to provide a digital upgrade to the AMPS system and maintain a certain degree of 
backward compatibility. It was phased out in the 2000s in favor of GSM and third-generation 
technologies. The major enhancements of second-generation systems were the inclusion of digital 
technology, security, text messaging, and data services (especially in subsequent enhancements). 

The third generation (3G) of cellular standards, deployed in the 2000s, was standardized by 3GPP 
(3rd Generation Partnership Project) and 3GPP2 (3rd Generation Partnership Project 2). UMTS 
(Universal Mobile Telecommunications System) was specified by 3GPP as the 3G technology based 
on the GSM standard [193, 79]. It used a similar network infrastructure and a higher-capacity digital 


transmission technology. The evolution of cdmaOne led to CDMA2000, which was standardized by 
3GPP2 [364, 97]. Notably, both UMTS and CDMA2000 employ CDMA. The major advance of third- 
generation standards over the second generation was higher voice capacity (the ability to support 
more voice users), broadband Internet access, and high-speed data. 


The fourth generation of cellular standards was the object of much development, and much debate 
(even over the definition of “fourth generation’). In the end, two systems were officially designated 
as fourth-generation cellular systems. One was 3GPP LTE (Long Term Evolution) Advanced release 
10 and beyond [93, 253, 299, 16]. The other was WiMAX (Worldwide Interoperability for 
Microwave Access), a subset of the IEEE 802.16 m standard [194, 12, 98, 260]. Though WiMAX 
was deployed earlier, 3GPP LTE became the de facto 4G standard. A major departure from third- 
generation systems, fourth-generation systems were designed from the ground up to provide wireless 
Internet access ina large area. 3GPP LTE is an evolution of 3GPP that supports larger-bandwidth 
channels and a new physical layer based on OFDMA (orthogonal frequency-division multiple access) 
where subcarriers can be dynamically assigned to different users. OFDMA is a multiple-access 
version of OFDM (orthogonal frequency-division multiplexing), which is discussed in Chapter 5. 
3GPP LTE Advanced adds other new capabilities, including more support for MIMO (multiple input 
multiple output) communication enabled by multiple antennas at the base station and handset, and thus 
supports higher data rates. WiMAX is based on the IEEE 802.16 standard. Essentially, the WiMAX 
Forum (an industry consortium) is specifying a subset of functions for implementation, and 
appropriate certification and testing procedures will ensure interoperability. WiMAX also employs 
OFDMA, though note that earlier versions used a slightly different access technique based on OFDM. 
Fourth-generation systems make more use of multiple antennas via MIMO communication, which is 
discussed in Chapter 6. The fourth generation of cellular systems promises higher data rates than 
previous systems along with network enhancements such as simplified backhaul architectures. 


Research on the fifth generation of cellular standards has begun in 3GPP. At the time of the writing 
of this book, various technologies are being considered to further improve throughput and quality and 
reduce latency and costs. There 1s great interest in continuing to push MIMO communication to its 
limits [321]. Massive MIMO promises hundreds of antennas at the base station to support more users 
simultaneously [223, 185, 42], and full-dimensional MIMO uses horizontal and vertical beamforming 
to support more users [238]. Millimeter wave MIMO systems making use of spectrum above 30GHz 
are also being considered for the fifth generation of cellular systems [268, 262, 269]. Research on all 
of these topics is ongoing [45, 11]. 


1.2.4 Wireless Local Area Networks (WLANs) 


Wireless local area networks are a wireless counterpart to Ethernet networks, whose initial objective 
was to deliver data packets from one computer to another. A wireless local area network is 
illustrated in . All WLANS use digital communication. The original objective of WLANs 
was simply to implement a local area network; in current deployments WLANs are seen as a primary 
means for wireless Internet access. Compared with cellular networks that use expensive licensed 
spectrum, WLANs are implemented in unlicensed bands like the ISM (Industrial, Scientific, and 
Medical) and U-NII (Unlicensed National Information Infrastructure) radio bands in the United States. 
This means they can be installed by anyone with approved equipment but cannot provide guaranteed 
service. WLANs are philosophically different from cellular networks. While both may be used for 
wireless Internet access, WLANs are primarily an extension of a wired network and are not designed 
to provide seamless large-area coverage, like a cellular network, for example. Most WLANs 
implement only basic forms of handoff, if any handoff is implemented at all. 


Figure 1.3 A wireless local area network. Access points (APs) serve clients. Unlike in cellular 
systems, handoff is generally not supported. 


The most widely used WLAN standards are developed within the IEEE 802.11 working group 
279, |. IEEE 802 is a group that develops LAN and MAN (metropolitan area network) standards, 
focusing on the physical (PHY), media access control (MAC), and radio link protocol (link) layers, 
considered Layer 1 and Layer 2 in typical networking literature [81]. The IEEE 802.11 working 
group focuses on WLANs. The Wi-Fi Alliance is an organization for certifying IEEE 802.11 products 
to guarantee interoperability (often Wi-Fi is used interchangeably with IEEE 802.11, though they are 
not exactly the same). Different subgroups of IEEE 802.11 are associated with different letters, such 
as IEEE 802.11b, IEEE 802.11a, IEEE 802.11g, and IEEE 802.11n. 

The original IEEE 802.11 standard supported 0.5Mbps (megabit-per-second) data rates with a 
choice of two different physical layer access techniques, either frequency-hopping spread spectrum 
or direct-sequence spread spectrum in the 2.4GHz ISM band. IEEE 802.11b provides data rates of 
11 bps by using Complementary Code Keying modulation, extending the direct-sequence spread 
spectrum mode. IEEE 802.11la and IEEE 802.11g provide data rates of 54Mbps in the 5.8GHz and 
2.4GHz bands, respectively, using OFDM modulation, which is discussed in 

IEEE 802.11n 1s a high-throughput extension of IEEE 802.11 g and IEEE 802.11a that uses MIMO 
communication, combined with OFDM, to provide even higher data rates [360, ]. MIMO enables 


a new class of modulation techniques, some of which can be used to send multiple data streams in 
parallel, and others that provide higher reliability as described further in Chapter 6. More advanced 
high-throughput extensions of IEEE 802.11 were developed as IEEE 802.1lac and IEEE 802.1 1ad. 
Two letters are used since single letters have been exhausted through other extensions of the standard. 
IEEE 802.11lac focuses on sub-6GHz solutions [29], and IEEE 802.1lad focuses on higher-frequency, 
in particular the 60GHz millimeter wave unlicensed band, solutions [258, 268]. Compared with IEEE 
802.11n, IEEE 802.1lac supports more advanced MIMO capability (up to eight antennas) and 
multiuser MIMO communication, where the access point communicates with several users at the same 
time. IEEE 802.1 1ad is the first WLAN solution at millimeter wave, providing gigabit-per-second 
(Gbps) peak throughputs. The next generation of WLAN is currently in development under the name 
IEEE 802.1 lay; it will support multiuser operation, targeting 100Gbps data rates and an extended 
transmission distance of 300—500m. 


1.2.5 Personal Area Networks (PANs) 


Personal area networks (PANS) are digital networks intended for short-range connectivity, typically 
on the order of 10min all directions, especially for wire replacement. An example of a PAN is 
illustrated in Figure 1.4. One of the most appropriate applications of a WPAN (wireless PAN) is to 
connect devices in the user’s personal space, that 1s, the devices an individual carries on or near the 
person, such as keyboards, headphones, displays, audio/video players, tablets, or smartphones [353]. 
According to the standards, a PAN can be viewed as a “personal communication bubble” around a 
person. All PANs use digital communication. PANs have a major architectural difference from 
WLANs—they expect communication in an ad hoc fashion. This means that devices can set up ad hoc 
peer-to-peer networks without the aid of a central controller (or access point). PANs are also 
implemented in unlicensed spectrum. 
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Figure 1.4 A wireless personal area network formed on an office desk. A computer connects with 
all the other devices, namely, a monitor, a keyboard, a PDA, a scanner, and a printer, via wireless 
links. The typical distance between devices 1s 10m. 


Most PANs are developed within the IEEE 802.15 working group [308]. The Bluetooth standard, 
IEEE 802.15.1a and later extensions, is perhaps the most familiar protocol. It is most commonly used 
for wireless headset connections to cell phones, wireless keyboards, and wireless computer mice. 
Another PAN standard is IEEE 802.15.4, known as ZigBee, intended for low-power embedded 
applications like sensor networks, home monitoring and automation, and industry controls [227]. 
IEEE 802.15.3c was a high-data-rate extension of 802.15 to the millimeter wave unlicensed band 
(around 57GHz to 64GHz), which was not as successful as WirelessHD [356], which was developed 
by an industry consortium [268]. These systems provide high-bandwidth connections in excess of 
2Gbps for applications such as wireless HDMI (High-Definition Multimedia Interface) and wireless 
video display connections. The boundaries between WLAN and PAN are starting to blur, with IEEE 
802.1lad taking over many of the functions offered by 60GHz PAN. It is likely that such developments 
will continue with IEEE 802.1 lay. 


1.2.6 Satellite Systems 


Satellite systems use space-based transceivers at very high altitudes over the Earth’s surface to 
provide coverage over large geographic areas, as illustrated in Figure 1.5. They are an alternative to 
terrestrial communication networks, where the infrastructure equipment is located on the ground. The 
idea of telecommunication satellites originated from a paper by Arthur C. Clarke, a science fiction 
writer, in Wireless World magazine in 1945 [74]. That paper proposed the use of the orbital 
configuration of a constellation of three satellites in the geo-stationary Earth orbit (GEO) at 35,800km 
to provide intercontinental communication services. Other orbits, namely, LEO (low Earth orbit) 
between 500km and 1700km and MEO (medium Earth orbit) between 5000km and 10,000km and 
over 20,000km, are now employed as well [222]. The higher orbit provides more coverage, that is, 
fewer satellites, but at the cost of larger propagation delay and free space loss. Until the 1960s, 
though, satellites were not actually for telecommunications in practice, but for observation and 
probes. Project SCORE, launched in 1958, was the world’s first communications satellite, providing 
a successful test of a space communication relay system. Since that time, the number of launched 
communication satellites has increased: 150 satellites during 1960-1970, 450 satellites during 1970-— 
1980, 650 satellites during 1980-1990, and 750 satellites during 1990—2000 [221]. 


Figure 1.5 The components of a satellite system. A satellite located at a high altitude over the 
Earth’s surface acts as a repeater to help point-to-point and point-to-multipoint transmissions 
between VSATs on the ground. 


Satellites, in the context of telecommunications, act as repeaters to help both point-to-point and 
point-to-multipoint transmissions of signals. Traditionally, communication satellites provide a wide 
range of applications, including telephony, television broadcasting, radio broadcasting, and data 
communication services [94]. Compared to other systems, communication satellite systems stand out 
because of their broad coverage, especially their ability to provide services to geographically 
isolated regions or difficult terrains. For example, mobile satellite services would target land mobile 
users, maritime users [180], and aeronautical users [164]. 


Satellites provide both long-distance (especially intercontinental) point-to-point or trunk telephony 
services as well as mobile telephony services. In 1965, Intelsat launched the first commercial 
satellite, named Early Bird, to provide intercontinental fixed telephony services. Communication 
satellite systems are able to provide worldwide mobile telephone coverage, also via digital 
communication technology [286]. The first GEO satellite providing mobile services, Marisat, was 
launched into orbit in 1976. Other examples of systems include Iridium, Inmarsat, and Globalstar. 
Satellite phones are inherently more expensive because of the high cost of putting a satellite in orbit 
and their low capacity. Satellite phones are useful in remote areas and for sea-based communication; 
their use in populated areas has been eclipsed by cellular networks. 


Television accounts for about 75% of the satellite market for communication services [221]. Early 
satellite TV systems used analog modulation and required a large receiving dish antenna. In 1989, 
TDF 1 was launched as the first television direct broadcasting satellite. Now most satellite TV 
programming is delivered via direct broadcast satellites, which use digital communication 
technology. Some examples of current communications satellites used for TV broadcasting 
applications are Galaxy and EchoStar satellites in the United States, Astra and Eutelsat Hot Bird in 
Europe, INSAT in India, and JSAT satellites in Japan. 


A recent application of satellite broadcast is high-definition radio. In the last 20 years, satellite 
radio has taken off in many areas [355]. The initial applications of satellites in radio were to provide 
high-fidelity audio broadcast services to conventional AM or FM broadcast radio stations. Now they 
are widely used for transmitting audio signals directly to the users’ radio sets. In satellite radio 
systems like SiriusXM [88], based on Sirius and XM technology [247, 84], digital communication is 
used to multicast digital music to subscribers. Other information may also be bundled in the satellite 
radio transmissions such as traffic or weather information. 


A final application of satellites is for data communication. Satellite systems provide various data 
communication services, including broadcast, multicast, and point-to-point unidirectional or 
bidirectional data services [105]. Example services include messaging, paging, facsimile, data 
collection from sensor networks, and of course wireless Internet access. Unidirectional or broadcast 
data communication services are often provided by VSAT (very small aperture terminal) networks [4, 
66, 188], using GEO satellites. VSAT networks work well for centralized networks with a central 
host and a number of geographically dispersed systems. Typical examples are small and medium-size 
businesses with a central office and banking institutions with branches in different locations. VSAT 
networks are also used for wireless Internet access in rural areas. 


High-altitude platform (HAP) stations are a hybrid technology that combines the benefits of 
terrestrial and satellite communication systems. Examples of HAP are unmanned airships and 
manned/unmanned aircraft flying in the stratosphere just above the troposphere, at an altitude of about 
17km or higher [76, 18, 103]. HAP stations may fill the gap between satellite-based communication 
systems, which are expensive and put high demands on the subscriber units because of the large 
distance to the satellites, and the terrestrial transmitters, which suffer from limited coverage. They 
may also be an alternative to cellular systems for telephony and wireless Internet access in parts of 
the world that lack cellular infrastructure. 


1.2.7 Wireless Ad Hoc Networks 


Ad hoc networks are characterized by their lack of infrastructure. Whereas users in cellular networks 
normally communicate with fixed base stations, users in ad hoc networks communicate with each 
other; all users transmit, receive, and relay data. A fantastic use case for ad hoc networks is by 
emergency services (police, search and rescue). Disasters, such as Hurricane Katrina, the earthquake 
in Haiti, or the typhoon in the Philippines, destroy the cellular infrastructure. Collaboration of rescue 
crews, communication with loved ones, and coordination of aid delivery are drastically hindered by 
the devastation. A mobile ad hoc network can transform a smartphone into both a cell tower and a 
cell phone. In this way, data can be transmitted throughout the disaster area. Ad hoc networks are also 
important in the military where there is high mobility and an inability to rely on existing fixed 
infrastructure. The soldiers of the future will require reliable, easily deployable, decentralized high- 
speed wireless communication networks for high-quality video, imagery, voice, and position data to 
ensure an information advantage in combat. There are many practical applications of ad hoc 
networks. 


Ad hoc networking capability is a core part of most PANs. With Bluetooth, for example, devices 
self-organize into a piconet with one device acting as the master and the other devices slaved to that 
master. The master coordinates transmissions among the various devices. WLANs also support ad 
hoc capability for communication between devices, and also a more formal mesh capability in IEEE 
802.11s [61]. Cellular networks are starting to support device-to-device communication where 


devices can exchange data directly without going through the base station [89, 110]. This is nota 
completely self-organized ad hoc operation, though, because the devices may coordinate key network 
operations like device discovery through the base station. 


A recent application of mobile ad hoc networking is vehicular ad hoc networking (commonly 
known as VANETs in the literature) [329]. As illustrated in Figure 1.6, VANETs involve both vehicle- 
to-vehicle communication and vehicle-to-infrastructure communication and are a key ingredient in 
connected and automated vehicles. A difference between VANETs and other ad hoc networks is in the 
overlying applications. Safety is a primary application of VANETs. For example, the dedicated short- 
range communication protocol [41, 232, 177] allows vehicles to exchange messages with position 
and velocity information for applications such as forward collision warning. Next-generation 
connected vehicles will exchange even more information. For example, sharing perceptual data 
among neighboring vehicles can extend a vehicle’s perception range beyond its visual line of sight 
[69]. This data can be fused to create a bird’s eye view of neighboring traffic, which can assist both 
automated and human drivers in difficult driving tasks such as overtaking and lane changing [234]. 
VANETs, especially at millimeter wave, continue to be an active area of research [337]. 


Figure 1.6 VANETs consist of vehicles, each of which is capable of communicating with other 
vehicles or infrastructure within its radio range for a variety of purposes, including, for example, 
collision avoidance. 


1.2.8 Wireless Sensor Networks 


A wireless sensor network is a form of an ad hoc wireless network, where wirelessly connected 
sensors relay information to some selected nodes at appropriate times. Advances in wireless 
communication, signal processing, and electronics have enabled the development of low-cost, low- 
power, multifunctional sensor nodes that are small in size and capable of sensing, data processing, 
and communicating [7]. The most important factor in the design of wireless sensor networks is the 
short network lifetime due to finite-capacity batteries [5]. 


Energy networks provide another potential application of wireless communication in the form of 
sensor networks. The electric power grid is based on hundred-year-old technology where power is 
sent into the network and consumption is measured by electric meters, which are read infrequently. 
Sensors can be used to enable what is called a smart grid, which supports features like demand-based 
pricing and distributed power generation [58, 293]. Many aspects of the smart grid are enabled 
through wireless meters. Smart grids can be implemented with a host of different wireline or wireless 
technologies. There are many research challenges in smart grid technology, including control, 
learning, and system-level issues. 


RFID (radio frequency identification) is a special type of communication that is used in 
applications such as manufacturing, supply chain management, inventory control, personal asset 
tracking, and telemedicine [361, 183, 62]. An RFID system consists of RFID tags, which are given to 
products and objects for identification purposes, and RFID readers. Readers broadcast queries to 
tags in their radio range for information control, and tags reply with stored identification information, 
typically using energy from the broadcast query to power the RFID circuit and transmitter [64]. Since 
no active transmission is involved, the power consumption for communication is very low [5]. RFID 
may be used ina sensor network as both a sensor and a means of communication to detect, for 
example, if the RFID tag (or the object that is tagged) is physically present in a given location. RFID 
has been standardized by EPCglobal and the ISO (International Organization for Standardization). 
The battery-free design of the typical RFID tag makes its design different from that of conventional 
communication systems. 


1.2.9 Underwater Communication 


Underwater communication is another niche application of wireless communication. Some 
applications of underwater communication are illustrated in Figure 1.7. The major difference from 
other forms of communication discussed in this chapter is that underwater communication is most 
often conceived with acoustic propagation versus electromagnetic waves in radio frequency wireless 
systems. The high conductivity in seawater, induced by salinity, causes large attenuation in 
electromagnetic radiation methods, making electromagnetic waves incapable of propagating over 
long distances. Acoustic methods have their own limitations, mainly a very limited bandwidth. 
Generally speaking, acoustic methods are used for low-rate long-distance transmission, whereas 
electromagnetic methods may be used for high-rate short-range transmission [168]. 
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Figure 1.7 An underwater communication system. Submarines can communicate with fixed stations 
located at the sea bottom or with ships on the surface. 


Modern underwater communication systems use digital transmission [311, 315, 265]. Froma signal 
processing perspective, underwater communication requires the use of sophisticated adaptive 
receiver techniques [311]. The reason is that, relatively speaking, the underwater propagation channel 
changes and presents many multipaths. Most radio frequency wireless systems are designed with a 
kind of block invariance where time variation can be neglected in short processing intervals. 


This assumption may not be appropriate for underwater communication due to the rapid channel 
variations. The main applications of underwater communication are found in the military, for 
example, ship-to-ship, ship-to-shore, and ship-to-sub, though there are commercial applications in the 
petroleum industry, such as autonomous underwater vehicles. Communicating underwater is a growth 
industry for the United States Navy. Two-way underwater digital communication between submarines 
and the AUTEC (Atlantic Undersea Test and Evaluation Center) range-control station in the Bahamas 
has been successfully demonstrated [144]. Sensor networks are also applied underwater for 
oceanographic data collection, environment monitoring, explorations, and tactical surveillance [9]. 
Many of the concepts developed in this book can be applied to underwater communication systems, 
with some modifications to account for variability of the propagation channel. 


1.3 Signal Processing for Wireless Communication 


A signal is a function that describes how a physical or a nonphysical variable changes over time 
and/or space. Signals are usually acquired by sensors and transformed by a transducer into an 
appropriate form to be stored, processed, or transmitted. For example, a microphone contains a 
diaphragm to capture the audio signal and a transducer to convert that signal into a voltage. Ina 
wireless communication system, typical signals are the currents and the electromagnetic fields used to 
carry data from a transmitter to a receiver through a wireless channel. There are many other types of 
signals besides audio and communications signals: speech, image, video, medical signals like an 
electrocardiogram, or financial signals measuring, for example, the evolution of stock prices. Signal 
processing is a relatively new engineering discipline that studies how to manipulate signals to extract 
information or to change the characteristics of the signal with a given purpose. 


Though signal processing includes digital and analog techniques, DSP dominates most of the 
application scenarios. Therefore, an analog signal to be processed is discretized and quantized before 
manipulation. For example, the receiver ina wireless communication system has to apply some 
processing to the received signal to remove noise, cancel interference, or eliminate the distortion due 
to the propagation through the wireless channel; at the transmitter side, signal processing is used to 
generate the waveform to be transmitted and maximize the range or the amount of information per time 
unit that can be sent. The current trend is to perform all these operations in digital, placing an analog- 
to-digital converter (ADC) or a digital-to-analog converter (DAC) as close as possible to the receive 
or transmit antenna respectively. Figure 1.8 shows an example of a basic communication system using 
a signal processing approach, making use of analog and digital techniques. 
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Figure 1.8 Basic block diagram of a digital communication system making use of analog and 
digital signal processing 
Signal processing has many applications in other fields such as: 


¢ Speech and audio, for speaker recognition, text-to-speech conversion, speech recognition, 
speech or audio compression, noise cancellation, or room equalization. 


¢ Image and video, for image and video compression, noise reduction, image enhancement, 
features extraction, motion compensation, or tracking of objects. 


¢ Medicine, for monitoring and analysis of biosignals. 

* Genomics, for interpretation of genomic information. 

¢ Finance, to analyze financial variables mainly for prediction purposes. 
* Radar, for detecting targets and estimating their position and velocity. 


Signal processing is a discipline at the intersection of signal processing and applied mathematics. 
It did not emerge as an independent field of study until the mid-twentieth century [239]. By that time 
Norbert Wiener had proposed a random process model for the information source. He also invented 
the Wiener filter, which provides a statistical estimate of an unknown process from an observed noisy 
process. The landmark paper “A Mathematical Theory of Communication,” written by Claude 
Shannon in 1948 [302], established the foundations of communication theory by analyzing a basic 
digital communication system from a signal processing perspective, using Wiener’s idea to model 
information signals. The sampling theorem proposed by Harry Nyquist in 1928 and proved by 
Shannon in 1949 in his paper “Communication in the Presence of Noise” addressed the problem of 
sampling and reconstruction of continuous signals, a milestone in DSP. For subsequent years, 
however, analog signal processing continued to dominate signal processing applications, from radar 
signal processing to audio engineering [239]. The publication in 1965 by Cooley and Tukey of an 
algorithm for the fast implementation of the Fourier transform (now known as FFT) led to the 
explosion of DSP, making it possible to implement convolution much more efficiently. Speech coding 
for telephone transmission was at that time a very active signal processing area, which started to 
benefit from adaptive algorithms and contributed to the success of DSP. Since that time, DSP 
algorithms have continued to evolve, leading to better performance and the expansion of the range of 
applications that benefit from them. Wireless communication is not an exception; the incredible 
increase in performance and data rates experienced in recent years in many communication systems 
was made possible by the increased complexity of DSP techniques. 


A signal processing approach tackles problems from a system perspective, including models for 
the input and output signals at every block in the system. The different blocks represent the different 
processing stages, which can be realized with an analog device or a numerical algorithm 
implemented in a digital processor, as can be seen in Figure 1.8. There exists a trade-off between the 
complexity and the performance of the models used for the signals and the analog components of the 
system: more accurate models provide an excellent tool for the simulation and practical evaluation of 
the system, but they increase complexity and simulation time and make the theoretical analysis of the 
problems difficult. Statistical characterization of signals using random process theory and probability 
provides useful models for the signal carrying the information and also for the noise and the 
interfering signals that appear in a wireless communication system. 


Signal processing theory also provides mathematical tools to relate the different signals ina 
system, using concepts from calculus, linear algebra, and statistics. Chapter 3 reviews in detail the 
fundamental signal processing results that can be used in the design and analysis of wireless 
communication systems. Linear time-invariant systems are used extensively in wireless 
communication to model different devices in the system such as filters or equalizers. Many of the 
features of a communication system are better understood in the frequency domain, so Fourier 
analysis is also a basic tool for wireless engineers. Digital communication systems leverage multirate 


theory results as well, since multirate filters lead to efficient implementations of many of the 
operations usually performed in a digital transmitter or receiver. Finally, fundamental results in linear 
algebra are the basis for many signal processing algorithms used for different tasks at the receiver 
such as channel equalization. 


A digital signal processing approach to wireless communications, the so-called software-defined 
radio (SDR) concept, makes sense for many reasons, such as ease of reconfigurability (software 
download) or simultaneous reception of different channels and standards, as shown in Figure 1.9. 
Digitizing the communication signal at the output of the receive antenna may not be feasible, however, 
because of technical (a very high sampling frequency) or cost (too-high power consumption at the 
ADC) reasons. Therefore, a trade-off between analog signal processing and DSP is usually found in 
practical communication systems, which usually include an analog stage to downconvert the signal 
followed by a digital stage, as illustrated in Figure 1.9. Later chapters of this book provide several 
examples of functional block diagrams corresponding to current communication systems that make use 
of this approach. 
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Figure 1.9 A current digital communication receiver based on the SDR concept. It allows 
simultaneous reception of channels using different standards with a single piece of hardware for 
the receiver. 
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1.4 Contributions of This Book 


This book presents the fundamentals of wireless digital communication from a signal processing 
perspective. It makes three important contributions. First, it provides a foundation in the mathematical 
tools that are required for understanding wireless digital communication. Second, it presents the 
fundamentals of digital communication from a signal processing perspective, focusing on the most 
common modulations rather than the most general description of a communication system. Third, it 
describes specific receiver algorithms, including synchronization, carrier frequency offset estimation, 
channel estimation, and equalization. This book can be used in conjunction with the codeveloped 
laboratory course [147] or independently on its own. 


There are already a number of textbooks on related topics of wireless communication and digital 
communication. Most other textbooks on wireless communication are targeted toward graduate 
students in communications, building on the foundations of graduate courses in random processes and 
digital communication. Unfortunately, undergraduate students, graduate students in other areas, and 


practicing engineers may not have taken the typical graduate prerequisites for those textbooks. Other 
textbooks on digital communication are targeted toward one- or two-semester graduate courses, 
attempting to present digital communication in its most general form. This book, however, focuses on 
a subset of digital communication known as complex pulse-amplitude modulation, which is used in 
most commercial wireless systems. In addition, this book describes in detail important receiver 
signal processing algorithms, which are required to implement a wireless communication link. While 
most concepts are presented for a communication system with a single transmit and single receive 
antenna, they are extended at the end of the book to MIMO communication systems, which are now 
widely deployed in practice. 

For communications engineers, this book provides background on receiver algorithms like channel 
estimation and synchronization, which are often not explained in detail in other textbooks. It also 
provides an accessible introduction to the principles of MIMO communication. For signal processing 
engineers, this book explains how to view a communication link through a signal processing lens. In 
particular, input-output relationships are built upon principles from digital signal processing so that 
the entire system can be represented in terms of discrete-time signals. Critical background on 
communication system impairments and their models 1s provided, along with an approachable 
introduction to the principles of wireless channel modeling. For analog, mixed-signal, and circuit 
designers, this book provides an introduction into the mathematical principles of wireless digital 
communication. The formulations are simplified from what is found in other textbooks, yet what is 
presented is immediately practical and can be used to prototype a wireless communication link [147]. 


Coverage in this book is intentionally narrow. No attempt is made to present a framework that 
includes every possible kind of digital communication. The focus is specifically on complex pulse- 
amplitude modulated systems. Nor is any attempt made to provide optimum receiver signal 
processing algorithms for all different channel impairments. Rather the emphasis is on using simpler 
estimators like linear least squares, which work in practice. The foundations developed in this book 
are a great platform for further work in wireless communication. 


1.5 Outline of This Book 


This book is organized to allow students, researchers, and engineers to build a solid foundation in key 
physical-layer signal processing concepts. Each chapter begins with an introduction that previews the 
material in each section and ends with a summary of key points in bullet form. Examples and 
numerous homework problems are provided to help readers test their knowledge. 


This chapter serves as an introduction to wireless communication, providing a detailed overview 
of the myriad applications. It also provides some historical background on signal processing and 
makes the case for using signal processing to understand wireless communications. 


Chapter 2 provides an overview of digital communication. The review is built around a canonical 
block diagram for a digital communication system to provide context for developments in subsequent 
chapters. Then the components of that diagram are discussed in more detail. First the types of 
distortion introduced by the wireless channel are reviewed, including additive noise, interference, 
path loss, and multipath. The presence of the wireless channel introduces many challenges in the 
receiver signal processing. Then a brief overview is provided of source coding and decoding, with 
examples of lossless and lossy coding. Source coding compresses data, reducing the number of bits 
that need to be sent. Next, some background is provided on secret-key and public-key encryption, 
which is used to keep wireless links secure from eavesdroppers. Then channel coding and decoding 


are reviewed. Channel coding inserts structured redundancy that can be exploited by the decoder to 
correct errors. The chapter concludes with an introduction to modulation and demodulation, including 
baseband and passband concepts, along with a preview of the impact of different channel 
impairments. Subsequent chapters in the book focus on modulation and demodulation, correcting 
channel impairments, modeling the channel, and extending the exposition to multiple antennas. 


Chapter 3 provides a review of signal processing fundamentals, which are leveraged in subsequent 
parts of the book. It starts with an introduction to the relevant continuous-time and discrete-time 
signal notation, along with background on linear time-invariant systems, the impulse response, and 
convolution. Linear time-invariant systems are used to model multipath wireless channels. The 
chapter continues with a review of several important concepts related to probability and random 
processes, including stationarity, ergodicity, and Gaussian random processes. Next, some background 
is provided on the Fourier transform in both continuous and discrete time, as well as on signal power 
and bandwidth, as it is useful to view communication signals in both the time and frequency domains. 
The chapter continues with derivation of the complex baseband signal representation and complex 
baseband equivalent channel, both of which are used to abstract out the carrier frequency of a 
communication signal. It then provides a review of some multirate signal processing concepts, which 
can be developed for efficient digital implementation of pulse shaping. The chapter concludes with 
background on critical concepts from linear algebra, especially the least squares solution to linear 
equations. 


Chapter 4 introduces the main principles of complex pulse-amplitude modulation. The main 
features of the modulation are provided, including symbol mapping, constellations, and the modulated 
signal’s bandwidth. Then the most basic impairment of additive white Gaussian noise is introduced. 
To minimize the effects of additive noise, the optimal pulse-shaping design problem is formulated and 
solved by Nyquist pulse shapes. Assuming such pulse shapes are used, then the maximum likelihood 
symbol detector is derived and the probability of symbol error is analyzed. The topics in this chapter 
form a basic introduction to digital communication using pulse-amplitude modulation with perfect 
synchronization and only the most basic impairment of additive noise. 


Chapter 5 describes other impairments introduced in wireless communication. It starts with an 
overview of symbol synchronization and frame synchronization for flat-fading channels. This 
involves knowing when to sample and the location of the beginning of a frame of data. It then 
proceeds to present a linear time-invariant model for the effects of multiple propagation paths called 
frequency selectivity. Several mitigation strategies are described, including linear equalization. 
Because the distortion introduced by the frequency-selective channel varies over time, the chapter 
then describes approaches for channel estimation. The channel estimate is used to compute the 
coefficients of the equalizer. Alternative modulation strategies that facilitate equalization are then 
introduced: single-carrier frequency-domain equalization (SC-FDE) and OFDM. Specific channel 
estimation and carrier frequency offset correction algorithms are then developed for single-carrier 
and OFDM systems. Most of the algorithms in this chapter are developed by formulating a linear 
system and taking the least squares solution. The chapter concludes with an introduction to 
propagation and fading-channel models. These statistical models are widely used in the design and 
analysis of wireless systems. A review is provided of both large-scale models, capturing channel 
variations over hundreds of wavelengths, and small-scale models, incorporating variations over 
fractions of a wavelength. Ways to quantify frequency selectivity and time selectivity are introduced. 
The chapter concludes with a description of common small-scale fading-channel models for both flat 


and frequency-selective channels. 


Chapter 6 concludes the book with a concise introduction to MIMO communication. Essentially, 
the key concepts developed in this book are re-examined, assuming a plurality of transmit and/or 
receive antennas. Most of the development is built around flat-fading channels, with extensions to 
frequency selectivity through MIMO-OFDM provided at the end. The chapter starts with an 
introduction to the different configurations of multiple antennas in SIMO (single input multiple 
output), MISO (multiple input single output), and MIMO configurations. It then describes the basics of 
receiver diversity for SIMO systems, including antenna selection and maximum ratio combining, 
including their impact on the probability of vector symbol error. Next it explains some approaches for 
extracting diversity in MISO communication systems, including beamforming, limited feedback, and 
space-time coding. The chapter then introduces the important MIMO technique known as spatial 
multiplexing. Extensions to precoding, limited feedback, and channel estimation are also described. 
The chapter concludes with an overview of MIMO-OFDM, which combines MIMO spatial 
multiplexing with the ease of equalization in OFDM systems. Key ideas like equalization, precoding, 
channel estimation, and synchronization are revisited in this challenging setting of MIMO with 
frequency-selective channels. MIMO and MIMO-OFDM are used in many commercial wireless 
systems. 


The concepts developed in this book are ideally suited for practical implementation in software- 
defined radio. The author has developed a companion laboratory manual [147], which is sold as part 
of a package with the National Instruments Universal Software Radio Peripheral. That laboratory 
manual features seven experiments that cover the main topics in Chapter 4 and Chapter 5, along with 
a bonus experiment that explores the benefits of error control coding. Of course, the concepts can be 
demonstrated in practice in other ways, even using a speaker as a transmit antenna and a microphone 
as a receive antenna. Readers are encouraged to simulate algorithms when possible, along with 
working example and homework problems. 


1.6 Symbols and Common Definitions 


We use the notation in Table 1.1 and assign specific definitions to the variables in Table 1.2 
throughout this book. 


[Convolution operator 
| oa Bold lowercase is used to denote column vectors 
A | Bold uppercase is used to denote matrices 
a,d_| Non-bold letters are used to denote scalar values 
Tle] | Magnitude ofsealara 
Ya) | Vector norm ofa 
iAli 
TAT J Mattix transpose SSS 
[AP | Conjugate transpose 
AS [Conjugate 
AST | Matricinvere 
| AT Moore-Penrose pseudo-inverse 
a 


Sealan axilase of A in 1 row j‘* column 


Als 
Column consisting o rows k, Ls +1,...,m of matrix A 
@ 


Used to index a discrete-time signal 


Discrete-time scalar signal and value at n 


Discrete-time vector signal and value at n 
Discrete-time matrix signal and value at n 


Discrete-time vector signal in frequency domain at subcarrier n 
Discrete-time matrix signal in frequency domain at subcarrier n 


— = log;9 unless otherwise mentioned 
phase(-) | Principle phase of the argument 


Table 1.1 Generic Notation Used in This Book 


Signal energy 
[LE ___| Chane order 


L 
{h[é}}4_, Discrete-time ISI (intersymbol interference) channel impulse 
response with (ZL + 1) taps 
=D he 


version of s{n] 
N = 
when dealing with MIMO communication 


Number of training symbols 


tr 
Nt 
N, Number of receive antennas, assumed to be 1 except 
when dealing with MIMO communication 
Nz Number of MIMO data streams, usually equal to N; 
unless precoding is applied 


Symbol sampled received signal of dimension 
Nx 1 


Transmit symbol vector of dimension N, x 1 


Symbol sampled transmitted signal of dimension 
N, x 1, often related to s[n] through linear precoding 


2 


Means that z is a Gaussian random variable with 
mean m and variance 0 


Means that 2 is a circularly symmetric complex Gaus- 
sian random variable with complex mean m, total 
variance a”, the real and imaginary parts of x are 
independent, and the variance of the real and imagi- 
nary parts is 07/2 each 


Carrier frequency 
Speed of ight 


Table 1.2 Common Definitions Used in This Book 


1.7 Summary 


¢ Wireless communication has a large number of applications, which are different from each 
other in the propagation environment, transmission range, and underlying technologies. 


¢ Most major wireless communication systems use digital communication. Advantages of digital 
over analog include its suitability for use with digital data, robustness to noise, ability to more 
easily support multiple data rates and multiple users, and easier implementation of security. 


¢ Digital signal processing 1s well matched with digital communication. Digital signal processing 
makes use of high-quality reproducible digital components. It also leverages Moore’s law, 
which leads to more computation and reduced power consumption and cost. 


¢ This book presents the fundamentals of wireless digital communication as seen through a signal 
processing lens. It focuses on complex pulse-amplitude modulation and the most common 
challenges faced when implementing a wireless receiver: additive noise, frequency-selective 
channels, symbol synchronization, frame synchronization, and carrier frequency offset 
synchronization. 


Problems 


1. Wireless Devices/Networks in Practice This problem requires some research on the 
technical specifications of wireless networks or wireless devices. 


(a) Choose a current cell phone from three of these manufacturers: Nokia, Samsung, Apple, LG, 
Huawei, Sony, Blackberry, Motorola, or another of your choosing. Describe the wireless 
and cellular technologies and the frequency bands supported by each one. 

(b) Name at least three mobile service providers in your country. Which cellular technologies 
are currently supported by the networks? 

(c) Which of those three mobile service providers charge for data and what are the charges for 
a typical consumer plan (not business)? Why do you think some providers have stopped 
offering unlimited data plans? 

2. Wireless Device Comparison Fill in the following table for three cellular devices 
manufactured by the three companies in the table: 


'Manufacturer | Device #1| Device #2 | Device #3 
Device 
What wireless technologies are supported 
(i.e., Wi-Fi, cellular, etc.)? 


Which cellular standards are supported? 
} 


Which frequency bands are used for 
communication? 


What is the maximum (claimed) data 
rate supported by the device using 
cellular? 


What is the maximum (claimed) 
data rate supported by the device using 


Wi-Fi? 


What operating system is employed? 


3. Visible Light Communication (VLC) Do some research on VLC, an alternative to wireless 


communication using RF (radio frequency) signals. This is a topic that we do not cover much in 
the book, but the topics from the book can be used to understand the basic principles. Be sure to 
cite your sources in the answers below. Note: You should find some trustworthy sources that 

you can reference (e.g., there may be mistakes in the Wikipedia article or it may be incomplete). 


(a) Which part of the IEEE 802 LAN/MAN Standards Committee deals with VLC? 
(b) What is the concept of VLC? 

(c) What is the bandwidth of a typical VLC application? 

(d) Explain how VLC might be used for secure point-to-point communication. 

(e) Explain how VLC might be used for indoor location-based services. 

(f) Explain why VLC might be preferred on aircraft for multimedia delivery. 

(g) Explain how VLC could be used in intelligent transportation systems. 


4. Sensor Networking There are many kinds of wireless networks, such as wireless sensor 
networks, that have important applications in manufacturing. They are often classified as low- 
rate wireless personal area networks. This is a topic that we do not cover much in the book, but 
the topics from the book can be used to understand the basic principles. Be sure to cite your 
sources in the answers below. Note: You should find some trustworthy sources that you can 
reference (e.g., there may be mistakes in the Wikipedia article or it may be incomplete). 

(a) What is a wireless sensor network? 

(b) What is IEEE 802.15.4? 

(c) What is ZigBee? 

(d) How are IEEE 802.15.4 and ZigBee related? 

(e) What communication bands are supported by IEEE 802.15.4 in the United States? 


(f) What is the bandwidth of the communication channel specified by IEEE 802.15.4? Note: 
This is bandwidth in hertz, not the data rate. 


(g) What is the typical range of an IEEE 802.15.4 device? 
(h) How long should the battery last in an IEEE 802.15.4 device? 
(1) How are sensor networks being used to monitor highway bridges? 


5. Wireless and Intellectual Property The wireless industry has been plagued with lawsuits 
over intellectual property. Identify a recent case of interest and describe the parties and their 
positions. Then describe in at least half a page your opinion about the role of intellectual 
property in wireless communications. 


Chapter 2. An Overview of Digital Communication 


2.1 Introduction to Digital Communication 


Communication is the process of conveying information from a transmitter to a receiver, through a 
channel. The transmitter generates a signal containing the information to be sent. Upon transmission, 
the signal propagates to the receiver, incurring various types of distortion captured together in what is 
called the channel. Finally, the receiver observes the distorted signal and attempts to recover the 
transmitted information. The more side information the receiver has about the transmitted signal or the 
nature of the distortion introduced by the channel, the better the job it can do at recovering the 
unknown information. 


There are two important categories of communication: analog and digital. In analog 
communication, the source is a continuous-time signal, for example, a voltage signal corresponding to 
a person talking. In digital communication, the source 1s a digital sequence, usually a binary sequence 
of ones and zeros. A digital sequence may be obtained by sampling a continuous-time signal using an 
analog-to-digital converter, or it may be generated entirely in the digital domain using a 
microprocessor. Despite the differences in the type of information, both analog and digital 
communication systems send continuous-time signals. Analog and digital communication are widely 
used in commercial wireless systems, but digital communication is replacing analog communication 
in almost every new application. 


Digital communication is becoming the de facto type of communication for a number of reasons. 
One reason is that it is fundamentally suitable for transmitting digital data, such as data found on 
every computer, cellular phone, and tablet. But this is not the only reason. Compared to analog 
systems, digital communication systems offer higher quality, increased security, better robustness to 
noise, reductions in power usage, and easy integration of different types of sources, for example, 
voice, text, and video. Because the majority of the components of a digital communication system are 
implemented digitally using integrated circuits, digital communication devices take advantage of the 
reductions in cost and size resulting from Moore’s law. In fact, the majority of the public switched 
telephone network, except perhaps the connection from the local exchange to the home, is digital. 
Digital communication systems are also easy to reconfigure, using the concept of software-defined 
radio, though most digital communication systems still perform a majority of the processing using 
application-specific integrated circuits (ASICs) and field programmable gate arrays (FPGAs). 


This chapter presents an overview of the main components in a single wireless digital 
communication link. The purpose is to provide a complete view of the key operations found in a 
typical wireless communication link. The chapter starts with a review of the canonical transmitter and 
receiver system block diagram in Section 2.2. A more detailed description of the input, the output, 
and the functions of each component 1s provided in the following sections. In Section 2.3, the 
impairments affecting the transmitted signal are described, including noise and intersymbol 
interference. Next, in Section 2.4, the principles of source coding and decoding are introduced, 
including lossless and lossy compression. In Section 2.5, encryption and decryption are reviewed, 
including secret-key and public-key encryption. Next, in Section 2.6, the key ideas of error control 
coding are introduced, including a brief description of block codes and convolutional codes. Finally, 
in Section 2.7, an overview of modulation and demodulation concepts is provided. Most of the 
remainder of this book will focus on modulation, demodulation, and dealing with impairments caused 


by the channel. 


2.2 Overview of a Wireless Digital Communication Link 


The focus of this book is on the point-to-point wireless digital communication link, as illustrated in 
Figure 2.1. It is divided into three parts: the transmitter, the channel, and the receiver. Each piece 
consists of multiple functional blocks. The transmitter processes a bit stream of data for transmission 
over a physical medium. The channel represents the physical medium, which adds noise and distorts 
the transmitted signal. The receiver attempts to extract the transmitted bit stream from the received 
signal. Note that the pieces do not necessarily correspond to physical distinctions between the 
transmitter, the propagation channel, and the receiver. In our mathematical modeling of the channel, 
for example, we include all the analog and front-end effects because they are included in the total 
distortion experienced by the transmitted signal. The remainder of this section summarizes the 
operation of each block. 
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Figure 2.1 The components of a typical digital communication system 


Now we describe the components of Figure 2.1 1n more detail based on the flow of information 
from the source to the sink. The first transmitter block is devoted to source encoding. The objective of 
source encoding is to transform the source sequence into an information sequence that uses as few bits 
per unit time as possible, while still enabling the source output to be retrieved from the encoded 
sequence of bits. For purposes of explanation, we assume a digital source that creates a binary 
sequence {b[n]} =..., b[-1], b[0], b[1], .. .. A digital source can be obtained from an analog 
source by sampling. The output of the source encoder is {i[n]}, the information sequence. Essentially, 
source encoding compresses the source as much as possible to reduce the number of bits that need to 
be sent. 


Following source encoding is encryption. The purpose of encryption is to scramble the data to 
make it difficult for an unintended receiver to interpret. Generally encryption involves applying a 
lossless transformation to the information sequence {i[n]} to produce an encrypted sequence e[n] = 


p({i[n]}). Unlike source coding, encryption does not compress the data; rather, it makes the data 
appear random to an uninformed receiver. 


The next block is the channel coder. Channel coding, also called error control coding or forward 
error correction, adds redundancy to the encrypted sequence {e[n]} to create {c[n]} in a controlled 
way, providing resilience to channel distortions and improving overall throughput. Using common 
coding notation, for every é input bits, or information bits, there is an additional redundancy of r bits. 
The total number of bits is m =k +r; the coding rate is defined as k/m. This redundancy may enable 
errors to be detected, and even corrected, at the receiver so that the receiver can either discard the 
data in error or request a retransmission. 


Following channel coding, the coded bits {c[m]} are mapped to signals by the modulator. This is 
the demarcation point where basic transmitter-side digital signal processing for communication ends. 
Typically the bits are first mapped in groups to symbols {s[n]}. Following the symbol mapping, the 
modulator converts the digital symbols into corresponding analog signals x(t) for transmission over 
the physical link. This can be accomplished by sending the digital signal through a DAC, filtering, and 
mixing onto a higher-frequency carrier. Symbols are sent at a rate of R, symbols per second, also 


known as the baud rate; the symbol period T, = 1/R, is the time difference between successive 
symbols. 


The signal generated by the transmitter travels to the receiver through a propagation medium, 
known as the propagation channel, which could be a radio wave through a wireless environment, a 
current through a telephone wire, or an optical signal through a fiber. The propagation channel 
includes electromagnetic propagation effects such as reflection, transmission, diffraction, and 
scattering. 


The first block at the receiver is the analog front end (AFE) which, at least, consists of filters to 
remove unwanted noise, oscillators for timing, and ADCs to convert the data into the digital regime. 
There may be additional analog components like analog gain control and automatic frequency control. 
This is the demarcation point for the beginning of the receiver-side digital signal processing for 
digital communication. 


The channel, as illustrated in Figure 2.1, is the component of the communication system that 
accounts for all the noise and distortions introduced by the analog processing blocks and the 
propagation medium. From a modeling perspective, we consider the channel as taking the transmitted 
signal x(t) and producing the distorted signal y(¢). In general, the channel is the part of a digital 
communication system that is determined by the environment and is not under the control of the system 
designer. 


The first digital communication block at the receiver is the demodulator. The demodulator uses a 
sampled version of the received signal, and perhaps knowledge of the channel, to infer the transmitted 
symbol. The process of demodulation may include equalization, sequence detection, or other 
advanced algorithms to help in combatting channel distortions. The demodulator may produce its best 
guess of the transmitted bit or symbol (known as hard detection) or may provide only a tentative 
decision (known as a soft decision). 


Following the demodulator is the decoder. Essentially the decoder uses the redundancy introduced 
by the channel coder to remove errors generated by the demodulation block as a result of noise and 
distortion in the channel. The decoder may work jointly with the demodulator to improve 
performance or may simply operate on the output of the demodulator, in the form of hard or soft 


decisions. Overall, the effect of the demodulator and the decoder is to produce the best possible 
guess @|] of the encrypted signal given the observations at the receiver. 


After demodulation and detection, decryption is applied to the output of the demodulator. The 
objective is to descramble the data to make it intelligible to the subsequent receiver blocks. 
Generally, decryption applies the inverse transformation p!(-) corresponding to the encryption 
process to produce the transmitted information sequence filn }} that corresponds to the encrypted 


sequence {e|n| }. 


The final block in the diagram is the source decoder, which reinflates the data back to the form in 
which it was sent: s[n] = g(il n|): This is essentially the inverse operation of the source encoder. 


After source decoding, the digital data is delivered to higher-level communication protocols that are 
beyond the scope of this book. 


The processes at the receivers are in the reverse direction to the corresponding processes at the 
transmitter. Specifically, each block at the receiver attempts to reproduce the input of the 
corresponding block at the transmitter. If such an attempt fails, we say that an error occurs. For 
reliable communication, it is desirable to reduce errors as much as possible (have a low bit error 
rate) while still sending as many bits as possible per unit of time (having a high information rate). Of 
course, this must be done while making sensible trade-offs in algorithm performance and complexity. 


This section provided a high-level picture of the operation of a typical wireless digital 
communication link. It should be considered as one embodiment of a wireless digital communication 
system. At the receiver, for example, there are often benefits to combining the operations. In the 
following discussion as well as throughout this book, we assume a point-to-point wireless digital 
communication system with one transmitter, one channel, and one receiver. Situations with multiple 
transmitters and receivers are practical, but the digital communication is substantially more complex. 
Fortunately, the understanding of the point-to-point wireless system is an outstanding foundation for 
understanding more complex systems. 


The next sections in this chapter provide more detail on the functional blocks in Figure 2.1. Since 
the blocks at the receiver mainly reverse the processing of the corresponding blocks at the transmitter, 
we first discuss the wireless channel and then the blocks in transmitter/receiver pairs. This treatment 
provides some concrete examples of the operation of each functional pair along with examples of 
their practical application. 


2.3 Wireless Channel 


The wireless channel is the component that mainly distinguishes a digital wireless communication 
system from other types of digital communication systems. In wired communication systems, the 
signal is guided from the transmitter to the receiver through a physical connection, such as a copper 
wire or fiber-optic cable. Ina wireless communication system, the transmitter generates a radio 
signal that propagates through a more open physical medium, such as walls, buildings, and trees ina 
cellular communication system. Because of the variability of the objects in the propagation 
environment, the wireless channel changes relatively quickly and 1s not under the control of wireless 
communication system designers. 


The wireless channel contains all the impairments affecting the signal transmission in a wireless 
communication system, namely, noise, path loss, shadowing, multipath fading, intersymbol 


interference, and external interference from other wireless communication systems. These 
impairments are introduced by the analog processing at both the transmitter and the receiver and by 
the propagation medium. In this section, we briefly introduce the concepts of these channel effects. 
More detailed discussions and mathematical models for the effects of the wireless channel are 
provided in subsequent chapters. 


2.3.1 Additive Noise 


Noise is the basic impairment present in all communication systems. Essentially, the term noise refers 
to a signal with many fluctuations and is often modeled using a random process. There are many 
sources of noise and several ways to model the impairment created by noise. In his landmark paper in 
1948, Shannon realized for the first time that noise limits the number of bits that can be reliably 
distinguished in data transmission, thus leading to a fundamental limitation on the performance of 
communication systems. 


The most common type of noise in a wireless communication system is additive noise. Two 
sources of additive noise, which are almost impossible to eliminate by filters at the AFE, are thermal 
noise and quantization noise. Thermal noise results from the material properties of the receivers, or 
more precisely the thermal motion of electrons in the dissipate components like resistors, wires, and 
So on. Quantization noise is caused by the conversion between analog and digital representations at 
the DAC and the ADC due to quantization of the signal into a finite number of levels determined by 
the resolution of the DAC and ADC. 


The noise is superimposed on the desired signal and tends to obscure or mask the signal. To 
illustrate, consider a simple system with only an additive noise impairment. Let x(t) denote the 
transmitted signal, y(t) the received signal, and v(f) a signal corresponding to the noise. In the 
presence of additive noise, the received signal is 

y(t) = x(t) + v(t). (2.1) 
For the purpose of designing a communication system, the noise signal v(t) is often modeled as a 
Gaussian random process, which is explained in more detail in Chapter 3. Knowledge of the 
distribution is important in the design of receiver signal processing operations like detection. 


An illustration of the effects of additive noise is provided in Figure 2.2. The performance of a 
system inflicted with additive noise is usually a function of the ratio between the signal power and the 
noise power, called the signal-to-noise ratio (SNR). A high SNR usually implies a good 
communication link, where exact determination of “high” depends on system-specific details. 
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Figure 2.2 (a) An example of a pulse-amplitude modulated communication signal; (b) a realization 

of a noise process; (c) a noisy received signal with a large SNR; (d) a noisy received signal with a 
small SNR 


2.3.2 Interference 


The available frequency spectrum for wireless communication systems is limited and hence scarce. 
Therefore, users often share the same bandwidth to increase the spectral efficiency, as in cellular 
communication systems discussed in Chapter 1. This sharing, however, causes co-channel 
interference between signals of different pairs of users, as illustrated in Figure 2.3. This means that 
besides its desired signal, a receiver also observes additional additive signals that were intended for 
other receivers. These signals may be modeled as additive noise or more explicitly modeled as 
interfering digital communication signals. In many systems, like cellular networks, external 
interference may be much stronger than noise. As a result, external interference becomes the limiting 
factor and the system is said to be interference limited, not noise limited. 
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Figure 2.3 Interference in the frequency domain: (a) co-channel interference; (b) adjacent channel 
interference 


There are many other sources of interference ina wireless communication system. Adjacent 
channel interference comes from wireless transmissions in adjacent frequency bands, which are not 
perfectly bandlimited. It is strongest when a transmitter using another band is proximate to a receiver. 
Other sources of interference result from nonlinearity in the AFE, for example, intermodulation 
products. The analog parts of the radio are normally designed to meet certain criteria, so that they do 
not become the performance-limiting factor unless at a very high SNR. This book focuses on additive 
thermal noise as the main impairment; other types of interference can be incorporated if interference 
is treated as additional noise. 


2.3.3 Path Loss 

During the course of propagation from the transmitter to the receiver in the physical medium, the 
signal power decays (typically exponentially) with the transmission distance. This distance- 
dependent power loss is called path loss. An example path loss function is illustrated in Figure 2.4. 


Path loss 
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Figure 2.4 Path loss as a function of distance, measured as the ratio of receive power to transmit 
power, assuming a free-space path-loss model with isotropic antennas. More details about this 
channel model are found in Chapter 5. 


There are a variety of models for path loss. In free space, path loss is proportional to the square of 
the frequency carrier, meaning that the higher-frequency carrier causes a higher path loss, assuming 
the receive antenna aperture decreases with antenna frequency. Path loss also depends on how well 
the transmitter can “see” the receiver. Specifically, the decay rate with distance is larger when the 
transmitter and receiver are obstructed in what is called non-line-of-sight (NLOS) propagation and is 
smaller when they experience line-of-sight (LOS) propagation. Several parameters like terrain, 
foliage, obstructions, and antenna heights are often taken into account in the computation of path loss. 
The path loss can even incorporate a random component, called shadowing, to account for variability 
of the received signal due to scattering objects in the environment. 


A simple signal processing model for path loss is to scale the received signal by ,/G where the 
square root is to remind us that path loss is usually treated as a power. Modifying (2.1), the received 
signal with path loss and additive noise is 


y(t) = VGa(t) + v(t). (2.2) 


Path loss reduces the amplitude of the received signal, leading to a lower SNR. 


2.3.4 Multipath Propagation 


Radio waves traverse the propagation medium using mechanisms like diffraction, reflection, and 
scattering. In addition to the power loss described in the previous section, these phenomena cause the 
signal to propagate to the receiver via multiple paths. When there are multiple paths a signal can take 
between the transmitter and the receiver, the propagation channel is said to support multipath 
propagation, which results in multipath fading. These paths have different delays (due to different 
path lengths) and different attenuations, resulting in a smearing of the transmitted signal (similar to the 
effect of blurring an image). 


There are different ways to model the effects of multipath propagation, depending on the bandwidth 
of the signal. In brief, if the bandwidth of the signal is small relative to a quantity called the 
coherence bandwidth, then multipath propagation creates an additional multiplicative impairment like 
the \/G in (2.2), but it is modeled by a random process and varies over time. If the bandwidth of the 
signal is large, then multipath is often modeled as a convolution between the transmitted signal and 
the impulse response of the multipath channel. These models are described and the choice between 
models is explored further 1n Chapter 5. 


To illustrate the effects of multipath, consider a propagation channel with two paths. Each path 
introduces an attenuation a and a time delay t. Modifying (2.2), the received signal with path loss, 
additive noise, and two multipaths is 


wt) = V/Ga,x(t — ™) + VGa2gx(t — T2) + v(t). (2.3) 


The path loss ,/G is the same for both paths because it is considered to be a locally averaged 
quantity; additional differences in received power are captured in differences in a, and a,. The 


smearing caused by multipath creates intersymbol interference, which must be mitigated at the 
receiver. 


Intersymbol interference (ISI) is a form of signal distortion resulting from multipath propagation as 
well as the distortion introduced by the analog filters. The name intersymbol interference means that 
the channel distortion is high enough that successively transmitted symbols interfere at the receiver; 
that 1s, a transmitted symbol arrives at the receiver during the next symbol period. Coping with this 
impairment is challenging when data is sent at high rates, since in these cases the symbol time is short 
and even a small delay can cause intersymbol interference. The effects of intersymbol interference are 
illustrated in Figure 2.5. The presence of ISI is usually analyzed using eye diagrams, as shown in this 
figure. The eye pattern overlaps many samples of the received signal. The effects of receiving several 
delayed and attenuated versions of the transmitted signal can be seen in the eye diagram as a 
reduction in the eye opening and loss of definition. Intersymbol interference can be mitigated by using 
equalization, as discussed in Chapter 5, or by using joint detection. 
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Figure 2.5 (a) Multipath propagation; (b) eye diagram corresponding to single-path propagation; 
(c) eye diagram corresponding to multipath propagation, showing ISI effects 


2.4 Source Coding and Decoding 


The main objective of a digital communication system 1s to transfer information. The source, 
categorized as either continuous or discrete, is the origin of this information. A continuous source is a 
signal with values taken from a continuum. Examples of continuous-valued sources include music, 
speech, and vinyl records. A discrete source has a finite number of possible values, usually a power 
of 2 and represented using binary. Examples of discrete sources include the 7-bit ASCII characters 
generated by a computer keyboard, English letters, and computer files. Both continuous and discrete 
sources may act as the origin of information in a digital communication system. 


All sources must be converted into digital form for transmission by a digital communication 
system. To save the resources required to store and transport the information, as few bits as possible 
should be generated per unit time, with some distortion possibly allowed in the reconstruction. The 
objective of source coding is to generate a compressed sequence with as little redundancy as 
possible. Source decoding, as a result, reconstructs the original source from the compressed 
sequence, either perfectly or in a reasonable approximation. 

Source coding includes both lossy and lossless compression. In lossy compression, some 
degradation is allowed to reduce the amount of resulting bits that need to be transmitted [35]. In 
lossless compression, redundancy is removed, but upon inverting the encoding algorithm, the signal is 
exactly the same [34, 345]. For discrete-time sources—for example, if fand g are the source 
encoding and decoding processes—then s[n] = g(i{n]) = g(f(s[n})). For lossy compression, 
s{n| an s(n} while for lossless compression, s[n] = 8[n}. Both lossy encoding and lossless encoding 
are used in conjunction with digital communication systems. 


While source coding and decoding are important topics in signal processing and information 
theory, they are not a core part of the physical-layer signal processing in a wireless digital 
communication system. The main reason is that source coding and decoding often happen at the 
application layer, for example, the compression of video in a Skype call, which is far removed from 
the physical-layer tasks like error control coding and modulation. Consequently, we provide a review 
of the principles of source coding only in this section. Specialized topics like joint source-channel 
coding [78] are beyond the scope of this exposition. In the remainder of this book, we assume that 
source coding has already been performed. Specifically, we assume that ideal source coding has been 
accomplished such that the output of the source encoder is an independent identically distributed 
sequence of binary data. 


2.4.1 Lossless Source Coding 


Lossless source coding, also known as zero-error source coding, 1s a means of encoding a source 
such that it can be perfectly recovered without any errors. Many lossless source codes have been 
developed. Each source code is composed of a number of codewords. The codewords may have the 
same number of bits—for example, the ASCII code for computer characters, where each character is 
encoded by 7 bits. The codewords may also have different numbers of bits in what is called a 
variable-length code. Morse code 1s a classic variable-length code in which letters are represented 
by sequences of dots and dashes of different lengths. Lossless coding is ideal when working with data 
files where even a 1-bit error can corrupt the file. 


The efficiency of a source-coding algorithm is related to a quantity called entropy, whichis a 
function of the source’s alphabet and probabilistic description. Shannon showed that this source 


entropy is exactly the minimum number of bits per source symbol required for source encoding such 
that the corresponding source decoding can retrieve uniquely the original source information from the 
encoded string of bits. For a discrete source, the entropy corresponds to the average length of the 
codewords in the best possible lossless data compression algorithm. Consider a source that can take 
values from an alphabet with m distinct elements, each with probability p,, p5, - - ° , p,,. Then the 


entropy is defined as 


mm 


= —>opi logs (pi), (2.4) 


i=1 
where in the special case of a binary source 
H = —p, logs(p1) — pz logs(p2). (2.5) 


The entropy of a discrete source is maximized when all the symbols are equally likely, that is, p; = 
1/m fori=1,2,...,m. Amore complicated entropy calculation is provided in Example 2.1. 


Example 2.1 
A discrete source generates an information sequence using the quaternary alphabet {a, 6, c, d} with 
probabilities P(s = a) = 1/8 P(s = b) = 1/8 P(s=c) = 1/2 ang P(s = d) = 1/8, 
The entropy of the source is 
1 | ee | oe | l ] 
5 = = preg | Gs Nees maken fea = f0ks = ra G5 - 2.6 
H(s) (5 logs 3 | 3 logs 3 4 n logs n + — logs ) (2.6) 
= 1.76. (2.7) 

This means that lossless compression algorithms for encoding the source need at least 1.75 bits per 
alphabet letter. 


There are several types of lossless source-coding algorithms. Two widely known in practice are 
Huffman codes and Lempel-Ziv (ZIP) codes. Huffman coding takes as its input typically a block of 
bits (often a byte). It then assigns a prefix-free variable-length codeword in such a way that the more 
likely blocks have smaller lengths. Huffman coding is used in fax transmissions. Lempel-Ziv coding 
is a type of arithmetic coding where the data is not segmented a priori into blocks but is encoded to 
create one codeword. Lempel-Ziv encoding is widely used in compression software like LZ77, Gzip, 
LZW, and UNIX compress. An illustration of a simple Huffman code is provided in Example 2.2. 


Example 2.2 


Consider the discrete source given in Example 2.1. The Huffman coding procedure requires the 
forming of a tree where the branch ends correspond to the letters in the alphabet. Each branch is 
assigned a weight equal to the probability of the letter at the branch. Two branches with the lowest 
weights form a new branch whose weight is equal to the sum of the weights of the two branches. The 
procedure repeats with the new set of branches until forming the root with the weight of one. Each 
branch node is now labeled with a binary | or 0 decision to distinguish the two branches forming the 
node. The codeword for each letter is found by tracing the tree path from the root to the branching end 
where the letter 1s located. The procedure 1s illustrated in Figure 2.6. 
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Figure 2.6 Huffman coding tree for the discrete source in Example 2.1 


The codewords for the letters in the alphabet are as follows: 111 for a, 110 for b, 0 for c, and 10 
for d. Notice that the letters with a higher probability correspond to the shorter codewords. 
Moreover, the average length of the codewords is 1.75 bits, exactly equal to the source entropy H(s). 


2.4.2 Lossy Source Coding 


Lossy source coding is a type of source coding where some information is lost during the encoding 
process, rendering perfect reconstruction of the source impossible. The purpose of lossy source 
coding is to reduce the average number of bits per unit time required to represent the source, at the 
expense of some distortion in the reconstruction. Lossy source coding is useful in cases where the end 
receiver of the information is human, as humans have some tolerance for distortion. 


One of the most common forms of lossy source coding found in digital communication systems is 
quantization. Since continuous-time signals cannot be transmitted directly in a digital communication 
system, it is common for the source coding to involve sampling and quantization to generate a 
discrete-time signal from a continuous-time source. Essentially, the additional steps transform the 
continuous-time source into a close-to-equivalent digital source, which is then possibly further 
encoded. Note that these steps are the sources of errors in lossy source coding. In general, it is 
impossible to reconstruct uniquely and exactly the original analog sources from the resulting discrete 
source. 


The sampling operation is based on the Nyquist sampling theorem, which is discussed in detail in 
Chapter 3. It says that no information is lost if a bandlimited continuous-time signal is periodically 
sampled with a small-enough period. The resulting discrete-time signal (with continuous-valued 
samples) can be used to perfectly reconstruct the bandlimited signal. Sampling, though, only converts 
the continuous-time signal to a discrete-time signal. 


Quantization of the amplitudes of the sampled signals limits the number of possible amplitude 
levels, thus leading to data compression. For example, a B bit quantizer would represent a 
continuous-valued sample with one of 28 possible levels. The levels are chosen to minimize some 
distortion function, for example, the mean squared error. An illustration of a 3-bit quantizer is 
provided in Example 2.3. 


Example 2.3 

Consider a 3-bit uniform quantizer. The dynamic range of the source, where most of the energy is 
contained, is assumed to be from —1 to 1. The interval [—1, 1] is then divided into 2* = 8 segments of 
the same length. Assuming the sampled signal is rounded to the closest end of the segment containing 
it, each sample is represented by the 3-bit codeword corresponding to the upper end of the segment 
containing the sample as illustrated in Figure 2.7. For example, a signal with a sample value of 0.44 
is quantized to 0.375, which is in turn encoded by the codeword 101. The difference between 0.44 
and 0.375 is the quantization error. Signals with values greater than 1 are mapped to the maximum 
value, in this case 0.875. This results in a distortion known as clipping. A similar phenomenon 
happens for signals with values less than —1. 
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Figure 2.7 An example of pulse-code modulation using 3-bit uniform quantization 


There are several types of lossy coding beyond quantization. In other applications of lossy coding, 
the encoder is applied in a binary sequence. For example, the Moving Picture Experts Group (MPEG) 
defines various compressed audio and video coding standards used for digital video recording, and 
the Joint Photographic Experts Group (JPEG and JPEG2000) describes lossy still image compression 
algorithms, widely used in digital cameras [190]. 


2.5 Encryption and Decryption 


Keeping the information transmitted by the source understandable by only the intended receivers is a 
typical requirement for a communication system. The openness of the wireless propagation medium, 
however, provides no physical boundaries to prevent unauthorized users from getting access to the 
transmitted messages. If the transmitted messages are not protected, unauthorized users are able not 
only to extract information from the messages (eavesdropping) but also to inject false information 
(spoofing). Eavesdropping and spoofing are just some implications of an insecure communication 
link. 

In this section, we focus on encryption, which is one means to provide security against 
eavesdropping. Using encryption, the bit stream after source encoding is further encoded (encrypted) 
such that it can be decoded (decrypted) only by an intended receiver. This approach allows sensitive 
information to travel over a public network without compromising confidentiality. 


There are several algorithms for encryption, and correspondingly for decryption. In this section, 
we focus on ciphers, which are widely used in communication. Ciphers work by transforming data 
using a known algorithm and a key or pair of keys. In secret-key encryption, an example of a 
symmetric cipher, a single secret key is used by both the encryption and the decryption. In public-key 
encryption, an example of an asymmetric cipher, a pair of keys is used: a public one for encryption 
and a private one for decryption. In a good encryption algorithm, it is difficult or ideally impossible 
to determine the secret or private key, even if the eavesdropper has access to the complete encryption 
algorithm and the public key. Informing the receiver about the secret or private key is a challenge in 
itself; it may be sent to the receiver by a separate and more secure means. Figure 2.8 illustrates a 


typical secret-key communication system [303]. 
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Figure 2.8 A typical secret-key communication system 


Now we describe a typical system. Let k, denote the key used for encryption at the transmitter and 
k, the key for decryption at the receiver. The mathematical model described in Section 2.2 for 
encryption should be revised as e[] = p(i[n], k,), where p(-) denotes the cipher. At the receiver, the 
decrypted data can be written as ip 1] = g(e[n], ko) where q(-) is the corresponding decipher (1.e., 
the decrypting algorithm). In the case of public-key encryption, k, and k, are different from each 
other; £, may be known to everyone, but k, must be known by only the intended receiver. With secret- 
key encryption, k, = k> and the key remains unknown to the public. Most wireless communication 


systems use secret-key encryption because it is less computationally intensive; therefore we focus on 
secret-key encryption for the remainder of this section. 


Secret-key ciphers can be divided into two groups: block ciphers and stream ciphers [297]. Block 
ciphers divide the input data into nonoverlapping blocks and use the key to encrypt the data block by 
block to produce blocks of encrypted data of the same length. Stream ciphers generate a stream of 
pseudo-random key bits, which are then bitwise XORed (1.e., the eXclusive OR [XOR] operation) 
with the input data. Since stream ciphers can process the data byte after byte, and even bit after bit, in 


general they are much faster than block ciphers and more suitable to transmission of continuous and 
time-sensitive data like voice in wireless communication systems [297]. Moreover, a single bit error 
at the input of the block decrypter can lead to an error in the decryption of other bits in the same 
block, which is called error propagation. Stream ciphers are widely used in wireless, including 
GSM, IEEE 802.11, and Bluetooth, though the usage of block ciphers 1s also increasing with their 
incorporation into the 3GPP third- and fourth-generation cellular standards. 


The major challenge in the design of stream ciphers is in the generation of the pseudo-random key 
bit sequences, or keystreams. An ideal keystream would be long, to foil brute-force attacks, but 
would still appear as random as possible. Many stream ciphers rely on linear feedback shift registers 
(LFSRs) to generate a fixed-length pseudo-random bit sequence as a keystream [297]. In principle, to 
create an LFSR, a feedback loop is added to a shift register of length 1 to compute a new term based 
on the previous n terms, as shown in Figure 2.9. The feedback loop is often represented as a 
polynomial of degree n. This ensures that the numerical operation in the feedback path is linear. 
Whenever a bit is needed, all the bits in the shift register are shifted right 1 bit and the LFSR outputs 
the least significant bit. In theory, an n-bit LFSR can generate a pseudo-random bit sequence of length 
up to (2” — 1) bits before repeating. LFSR-based keystreams are widely used in wireless 
communication, including the A5/1 algorithm in GSM [54], EO in Bluetooth [44], the RC4 and AES 
algorithms [297] used in Wi-Fi, and even in the block cipher KASUMI used in 3GPP third- and 
fourth-generation cellular standards [101]. 


Figure 2.9 Stream ciphering with a linear feedback shift register used to generate the keystream 


2.6 Channel Coding and Decoding 


Both source decoding and decryption require error-free input data. Unfortunately, noise and other 
channel impairments introduce errors into the demodulation process. One way of dealing with errors 
is through channel coding, also called forward error correction or error control coding. Channel 
coding refers to a class of techniques that aim to improve communication performance by adding 
redundancy in the transmitted signal to provide resilience to the effects of channel impairments. The 
basic idea of channel coding is to add some redundancy in a controlled manner to the information bit 
sequence. This redundancy can be used at the receiver for error detection or error correction. 


Detecting the presence of an error gives the receiver an opportunity to request a retransmission, 
typically performed as part of the higher-layer radio link protocol. It can also be used to inform 
higher layers that the inputs to the decryption or source decoding blocks had errors, and thus the 
outputs are also likely to be distorted. Error detection codes are widely used in conjunction with 


error correction codes, to indicate whether the decoded sequence is highly likely to be free of errors. 
The cyclic redundancy check (CRC) reviewed in Example 2.4 is the most widely used code for error 
detection. 


Example 2.4 


CRC codes operate on blocks of input data. Suppose that the input block length is & and that the output 
length is n. The length of the CRC, the number of parity bits, is n — k and the rate of the code is k/n. 
CRC operates by feeding the length £ binary sequence into an LFSR that has been initialized to the 
zero state. The LFSR is parameterized by n — k weights w,, w>,..., w,—;. The operation of the 
LFSR is shown in Figure 2.10. The multiplies and adds are performed in binary. The D notation 
refers to a memory cell. As the binary data is clocked into the LFSR, the data in the memory cells is 
clocked out. Once the entire block has been input into the LFSR, the bits remaining in the memory 
cells form the length n — k parity. At the receiver, the block of bits can be fed into a similar LFSR and 
the resulting parity can be compared with the received parity to see if an error was made. CRC codes 
are able to detect about 1-2”) errors. 
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Figure 2.10 The process of generating a CRC using an LFSR 


Error control codes allow the receiver to reconstruct an error-free bit sequence, if there are not too 
many errors. In general, a larger number of redundant bits allows the receiver to correct for more 
errors. Given a fixed amount of communication resources, though, more redundant bits means fewer 
resources are available for transmission of the desired information bits, which means a lower 
transmission rate. Therefore, there exists a trade-off between the reliability and the rate of data 
transmission, which depends on the available resources and the channel characteristics. This trade- 
off is illustrated through the simplest error control code, known as repetition coding, in Example 2.5. 
The example shows how a repetition code, which takes 1 bit and produces 3 output bits (a rate 1/3 
code), is capable of correcting one error with a simple majority vote decoder. The repetition code is 
just one illustration of an error control code. This section reviews other types of codes, with a more 
sophisticated mathematical structure, that are more efficient than the repetition code. 


Example 2.5 


Consider a rate 1/3 repetition code. The code operates according to the mapping in Table 2.1. For 
example, if the bit sequence to be sent is 010, then the encoded sequence is 000111000. 


0 000 
] 111 
Table 2.1 The Encoding Rule for a Rate 1/3 Repetition Code 


Suppose that the decoder uses the majority decoding rule for decoding. This leads to the decoder 
mapping in Table 2.2. Based on the decoding table, if there is only one bit error, then the decoder 
outputs the correctly sent bit. If there are two or three bit errors, then the decoder decides in favor of 
the incorrectly sent bit and an error is made. 


Input | Output 
000 0 
001 0 
010 | 0 
011 | 1 
100 | O 
101 | 1 
110 1 
ny | 


Table 2.2 Majority Decoding for a Rate 1/3 Repetition Code 


The performance of the code 1s often measured in terms of the probability of bit error. Suppose that 
a bit error occurs with probability p and is memoryless. The probability that 2 or 3 bits are in error 
out of a group of 3 bits is 1-(1—p)° + 3p(1-p). For p = 0.01, the uncoded system sees an error 
probability of 0.01, which means on average one in 100 bits is in error. The coded system, though, 
sees an error probability of 0.000298, which means that on average about 3 bits in 10,000 are in 
error. 


While repetition coding provides error protection, it is not necessarily the most efficient code. This 
raises an important issue, that is, how to quantify the efficiency of a channel code and how to design 
codes to achieve good performance. The efficiency of a channel code is usually characterized by its 
code rate k/n, where k is the number of information bits and 7 is the sum of the information and 
redundant bits. A large code rate means there are fewer redundant bits but generally leads to worse 
error performance. A code is not uniquely specified by its rate as there are many possible codes with 
the same rate that may possess different mathematical structures. Coding theory is broadly concerned 
with the design and analysis of error control codes. 


Coding is fundamentally related to information theory. In 1948, Shannon proposed the concept of 
channel capacity as the highest information rate that can be transmitted through a channel with an 


arbitrarily small probability of error, given that power and bandwidth are fixed [302]. The channel 
capacity can be expressed as a function of a channel’s characteristics and the available resources. Let 
C be the capacity of a channel and R the information rate. The beauty of the channel capacity result is 
that for any noisy channel there exist channel codes (and decoders) that make it possible to transmit 
data reliably with the rate R < C. In other words, the probability of error can be as small as desired 
as long as the rate of the code (captured by R) is smaller than the capacity of the channel (measured 
by C). Furthermore, if R > C;, there is no channel code that can achieve an arbitrarily small 
probability of error. The main message is that there is a bound on how large R can be (remember that 
since R = k/n, larger R means smaller n — 4), which is determined by a property of the channel given 
by C. Since Shannon’s paper, much effort has been spent on designing channel codes that can achieve 
the limits he set out. We review a few strategies for error control coding here. 


Linear block codes are perhaps the oldest form of error control coding, beyond simply repetition 
coding. They take a block of data and add redundancy to create a longer block. Block codes generally 
operate on binary data, performing the encoding either in the binary field or in a higher-order finite 
field. Codes that operate using a higher-order finite field have burst error correction capability since 
they treat groups of bits as an element of the field. Systematic block codes produce a codeword that 
contains the original input data plus extra parity information. Nonsystematic codes transform the input 
data into a new block. The well-known Hamming block code 1s illustrated in Example 2.6. 


Various block codes are used in wireless communication. Fire codes [108], cyclic binary codes 
capable of burst error correction, are used for signaling and control in GSM [91]. Reed-Solomon 
codes [272] operate in a higher-order finite field and are typically used for correcting multiple burst 
errors. They have been widely used in deep-space communication [114] and more recently in 
WiMAX [244] and IEEE 802.11ad [162]. 


Example 2.6 


In this example we introduce the Hamming (7,4) code and explain its operation. The Hamming (7,4) 
code encodes blocks of 4 information bits into blocks of 7 bits by adding 3 parity/redundant bits into 
each block [136]. The code is characterized by the code generator matrix 


100 0 
010 0 
00 1 0 
G= /0 0 O 1 (2.8) 
‘a 
1 @ 1, 1 
2 ao oe 


which takes an input vector of 4 bits e to produce an output vector of 7 bits c = Ge, where the 
multiplication and addition happen in the binary field. Note that it is conventional in coding to 
describe this operation using row vectors, but to be consistent within this book we explain it using 
column vectors. The code generator contains the 4 x 4 identity matrix, meaning that this is a 
systematic code. 


The parity check matrix for a linear block code is a matrix H such that H'G = 0. For the Hamming 
code 


1 1 0 
101 
Ga i 
H=|1 1 1}. (2.9) 
10 0 
010 
001 


Now suppose that the observed data is @ — ¢ + y where vis a binary error sequence (being all zero 
means there is no error). Then 


H'¢é = H'v. (2.10) 


If #7 TE is not zero, then the received signal does not pass the parity check. The remaining component 
is called the syndrome, based on which the channel decoder can look for the correctable error 
patterns. The Hamming code is able to correct 1-bit errors, and its rate of 4/7 is much more efficient 
than the rate 1/3 repetition code that performs the same task. 


Another major category of channel coding found in wireless communications is convolutional 
coding. A convolutional code is in general described by a triple (4, n, K), where k and n have the 


same meaning as in block codes and K is the constraint length defining the number of k-tuple stages 
stored in the encoding shift register, which is used to perform the convolution. In particular, the n- 
tuples generated by a (k, n, K) convolutional code are a function not only of the current input k-tuple 
but also of the previous (K — 1) input k-tuples. K 1s also the number of memory elements needed in the 
convolutional encoder and plays a significant role in the performance of the code and the resulting 
decoding complexity. An example of a convolutional encode is shown in Example 2.7. 


Convolutional codes can be applied to encode blocks of data. To accomplish this, the memory of 
the encoder is initialized to a known state. Often it is initialized to zero and the sequence is then zero 
padded (an extra K zeros are added to the block) so that it ends in the zero state. Alternatively, tail 
biting can be used where the code 1s initialized using the end of the block. The choice of initialization 
is explained in the decoding algorithm. 


Example 2.7 


In this example we illustrate the encoding operation for a 64-state rate 1/2 convolutional code used in 
IEEE 802.11a as shown in Figure 2.11. This encoder has a single input bit stream (A = 1) and two 


output bit streams (7 = 2). The shift register has six memory elements corresponding to 2° = 64 states. 
The connections of the feedback loops are defined by octal numbers as 133, and 171, corresponding 


to the binary representations 001011011 and 001111001. Two sets of output bits are created, 
c[n] = e[n] Be[n — 2] Ge[n — 3] Be[n — 5] Ge[n — 6] (2.11) 
ca[n] = e[n] Be[n — 1] Geln — 2] Beln — 3] Ge[n — 6], (2.12) 


Cc; [ 
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Figure 2.11 The 64-state rate 1/2 convolutional code used in IEEE 802.11a 


where ® is the XOR operation. The encoded bit sequence is generated by interleaving the outputs, that 
is, c[2n] =c,[n] and c[2n + 1] =c,[n]. 


Decoding a convolutional code is challenging. The optimum approach 1s to search for the sequence 
that is closest (in some sense) to the observed error sequence based on some metric, such as 
maximizing the likelihood. Exploiting the memory of convolutional codes, the decoding can be 
performed using a forward-backward recursion known as the Viterbi algorithm [348]. The complexity 
of the algorithm is a function of the number of states, meaning that the error correction capability that 
results from longer constraint lengths comes at a price of more complexity. 


Convolutional coding is performed typically in conjunction with bit interleaving. The reason is that 
convolutional codes generally have good random error correction capability but are not good with 
burst errors. They may also be combined with block codes that have good burst error correction 
capability, like the Reed-Solomon code, in what is known as concatenated coding. Decoding 


performance can be further improved by using soft information, derived from the demodulation block. 
When soft information is used, the output of the demodulator ¢jn]| 1s a number that represents the 
likelihood that ¢jn| is a zero or a one, as opposed to a hard decision that is just a binary digit. Most 
modern wireless systems use bit interleaving and soft decoding. 


Convolutional codes with interleaving have been widely used in wireless systems. GSM uses a 
constraint length K = 5 code for encoding data packets. IS-95 uses a constraint length K = 9 code for 
data. IEEE 802.11a/g/n use a constraint length K = 7 code as the fundamental channel coding, along 
with puncturing to achieve different rates. 3GPP uses convolutional codes with a constraint length K = 
9 code. 


Other types of codes are becoming popular for wireless communication, built on the foundations of 
iterative soft decoding. Turbo codes [36] use a modified recursive convolutional coding structure 
with interleaving to achieve near-optimal Shannon performance in Gaussian channels, at the expense 
of long block lengths. They are used in the 3GPP cellular standards. Low-density parity check 
(LDPC) codes [217, 314] are linear block codes witha special structure that, when coupled with 
efficient iterative decoders, also achieve near-optimal Shannon performance. They are used in IEEE 
802.1lac and IEEE 802.11ad [259]. Finally, there has been interest in the recently developed polar 
codes [15], which offer options for lower-complexity decoding with good performance, and will be 
used for control channels in 5G cellular systems. 


While channel coding is an important part of any communication system, in this book we focus on 
the signal processing aspects that pertain to modulation and demodulation. Channel coding is an 
interesting topic in its own right, already the subject of many textbooks [200, 43, 352]. Fortunately, it 
is not necessary to become a coding expert to build a wireless radio. Channel encoders and decoders 
are available in simulation software like MATLAB and LabVIEW and can also be acquired as 
intellectual property cores when implementing algorithms on an FPGA or ASIC [362, 331]. 


2.7 Modulation and Demodulation 


Binary digits, or bits, are just an abstract concept to describe information. In any communication 
system, the physically transmitted signals are necessarily analog and continuous time. Digital 
modulation is the process at the transmit side by which information bit sequences are transformed into 
signals that can be transmitted over wireless channels. Digital demodulation is then the process at the 
receive side to extract the information bits from the received signals. This section provides a brief 
introduction to digital modulation and demodulation. Chapter 4 explores the basis of modulation and 
demodulation in the presence of additive white Gaussian noise, and Chapter 5 develops extended 
receiver algorithms for working with impairments. 


There are a variety of modulation strategies in digital communication systems. Depending on the 
type of signal, modulation methods can be classified into two groups: baseband modulation and 
passband modulation. For baseband modulation, the signals are electrical pulses, but for passband 
modulation they are built on radio frequency carriers, which are sinusoids. 


2.7.1 Baseband Modulation 


In baseband modulation, the information bits are represented by electric pulses. We assume the 
information bits require the same duration of transmission, called a bit time slot. One simple method 
of baseband modulation is that 1 corresponds to bit time slots with the presence of a pulse and 0 
corresponds to bit time slots without any pulse. At the receiver, the determination is made as to the 
presence or absence of a pulse in each bit time slot. In practice, there are many pulse patterns. In 
general, to increase the chance of detecting correctly the presence of a pulse, the pulse is made as 
wide as possible at the expense of a lower bit rate. Alternatively, signals can be described as a 
sequence of transitions between two voltage levels (bipolar). For example, a higher voltage level 
corresponds to | and a lower voltage level corresponds to 0. Alternate pulse patterns may have 
advantages including more robust detection or easier synchronization. 


Example 2.8 


We consider two examples of baseband modulation: the non-return-to-zero (NRZ) code and the 
Manchester code. In NRZ codes, two nonzero different voltage levels H and Lare used to represent 
the bits 0 and 1. Nevertheless, if the voltage is constant during a bit time slot, then a long sequence of 
all zeros or all ones can lead to loss of synchronization since there are no transitions in voltage to 
demarcate bit boundaries. To avoid this issue, the Manchester code uses the pulse with transitions that 
are twice as fast as the bit rate. Figure 2.12 illustrates the modulated signals for a bit sequence using 
the NRZ code and the Manchester code. 
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Figure 2.12 Examples of baseband modulation: an NRZ code and the Manchester code 


| The Manchester code 


In principle, a pulse is characterized by its amplitude, position, and duration. These features of a 
pulse can be modulated by the information bits, leading to the corresponding modulation methods, 
namely, pulse-amplitude modulation (PAM), pulse-position modulation (PPM), and pulse-duration 
modulation (PDM). In this book we focus on complex pulse-amplitude modulations, which consist of 
two steps. In the first step, the bit sequence is converted into sequences of symbols. The set of all 
possible symbols is called the signal constellation. In the second step, the modulated signals are 
synthesized based on the sequence of symbols and a given pulse-shaping function. The entire process 
is illustrated in Figure 2.13 for the simplest case of real symbols. This is the well-known baseband 
modulation method called M-ary PAM. 


Baseband 
PAM signal 


Channel Symbol Pulse-shaping 
coder mapping filter 
(Tt) 


Figure 2.13 Block diagram for generating /-ary PAM signals 


Example 2.9 
A pulse-amplitude modulated received signal at baseband may be written in the binary case as 


elt) = S > (-1)4"lgux(t — nT) + v(t), (2.13) 


nh 


where i[7] is the binary information sequence, g,,(¢) is the transmit pulse shape, 71s the symbol 


duration, and v(f) is additive noise (typically modeled as random). In this example, i[”] = 0 becomes 
a “1” symbol, and i[”] = 1 becomes a “—1” symbol. 


In M-ary PAM one of M allowable amplitude levels is assigned to each of the M symbols of the 
signal constellation. A standard M-ary PAM signal constellation C consists of M d-spaced real 
numbers located symmetrically around the origin, that is, 


, or; —1) —d 
Cpam = 4 ———, "5 > 


bn] & 


; 5 (2.14) 


— hn 
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d(M — 1) } 
where d 1s the normalization factor. Figure 2.14 illustrates the positions of the symbols for an 8-PAM 
signal constellation. The modulated signal corresponding to the transmission of a sequence of // real 
symbols using PAM modulation can be written as the summation of uniform time shifts of the pulse- 
shaping filter multiplied by the corresponding symbol 


M 
x(t) =~ skgtx(t — kT), (2.15) 
k=1 
a 
0 


Figure 2.14 An 8-PAM signal constellation. d is the normalization factor. 


where 7 is the interval between the successive symbols and s, is the k"" transmitted symbol, which is 
selected from Cpa, based on the information sequence [7]. 


The transmitter outputs the modulated signals to the wireless channel. Baseband signals, which are 
produced by baseband modulation methods, are not efficient because the transmission through the 
space of the electromagnetic fields corresponding to low baseband frequencies requires very large 
antennas. Therefore, all practical wireless systems use passband modulation methods. 


Example 2.10 


Binary phase-shift keying (BPSK) is one of the simplest forms of digital modulation. Let i[”] denote 
the sequence of input bits, s[7] a sequence of symbols, and x(t) the continuous-time modulated signal. 
Suppose that bit 0 produces A and bit 1 produces —A. 


* Create a bit-to-symbol mapping table. 
Answer: 


0 A 
1 —A 
* BPSK is a type of pulse-amplitude modulation. Let g,,(t) denote the pulse shape and let 
90 
rti= > s[n]gtx(t — nT) (2.16) 
n=—co 
denote the modulated baseband signal. Let the pulse g;,.(t) = Fr for t © [0, 7] and g,,(t) =0 


otherwise. Draw a graph of x(t) for the input i[0] = 1, i[1] = 1, i[2] =0, i[3] =1, with A =3 and 
T = 2. Illustrate for t © [0, 8]. 


Answer: The illustration is found in Figure 2.15. 


-2 0 2 a 6 8 10 
time 
Figure 2.15 Output of a BPSK modulator when the transmit pulse-shaping filter is a rectangular 
function 


2.7.2 Passband Modulation 


The signals used in passband modulation are RF carriers, which are sinusoids. Sinusoids are 
characterized by their amplitude, phase, and frequency. Digital passband modulation can be defined 
by the process whereby any features, or any combination of features, of an RF carrier are varied in 
accordance with the information bits to be transmitted. In general, a passband modulated signal x,(¢) 


can be expressed as follows: 
p(t) = A(t) cos(27f,t + o(t)), (2.17) 


where A(f) is the time-varying amplitude, f, is the frequency carrier in hertz, and (t) is the time- 


varying phase. Several of the most common digital passband modulation types are amplitude-shift 
keying (ASK), phase-shift keying (PSK), frequency-shift keying (FSK), and quadrature-amplitude 
modulation (QAM). 


In ASK modulation methods, only the amplitude of the carrier is changed according to the 
transmission bits. The ASK signal constellation is defined by different amplitude levels: 


Cask = {Aj, Ao,-++ , Ans}. (2.18) 
Therefore, the resulting modulated signal 1s given by 
Lp(t) = A(t) cos(2rf.t + do), (2.19) 


where f, and #9 are constant; 71s the symbol duration time, and 


A(t) = > s|n|gtx(t — nT), 


m=— co 


where s(7) is drawn from Cacx. Figure 2.16 shows the block diagram of a simple ASK modulator. 
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Figure 2.16 Block diagram for generating ASK signals 
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The PSK modulation methods transmit information by changing only the phase of the carrier while 
the amplitude is kept constant (we can assume the amplitude is 1). The modulation is done by the 
phase term as follows: 


Lp(t) = Acos(27f.t + o(t)), (2.20) 
where 0(¢) is a signal that depends on the sequence of bits. Quadrature PSK (QPSK) and 8-PSK are 


the best-known PSK modulations. 8-PSK modulation is often used when there is a need for a 3-bit 
constellation, for example, in the GSM and EDGE (Enhanced Data rates for GSM Evolution) 


standards. 
In FSK modulation, each vector symbol is represented by a distinct frequency carrier f, for k = 1, 
2,° °°, M. For example, it may look like 


Ey(t) = cos(2Qrf,t), O<t < T. (2.21) 


This modulation method thus requires the availability of a number of frequency carriers that can be 
distinguished from each other. 


The most commonly used passband modulation is M-QAM modulation. QAM modulation changes 
both the amplitude and the phase of the carrier; thus it is a combination of ASK and PSK. We can also 
think of an M-QAM constellation as a two-dimensional constellation with complex symbols or a 
Cartesian product of two M/2-PAM constellations. QAM modulated signals are often represented in 
the so-called IQ form as follows: 


Zp(t) = S- Re{s|k]}ox(t — kT) cos(27 f.t) 
k=—o0o 
- S- Im{s[k] }oex(t — kT) sin(27 ft), (2.22) 
k=—00 
= z(t) cos(27f.t) — rQ(t) sin(27f,t), (2.23) 


where x(t) and x((t) are the M/2-PAM modulated signals for the inphase and quadrature components. 
M-QAM is described more extensively in Chapter 4. 


Figure 2.17 shows the block diagram of a QAM modulator. M-QAM modulation is used widely in 
practical wireless systems such as IEEE 802.11 and IEEE 802.16. 
x\(f) 


Xp(b) 


Figure 2.17 Block diagram for generating QAM signals 


Example 2.11 


This example considers BPSK with passband modulation, continuing Example 2.10. For BPSK, the 
passband signal is denoted as 


Lp(t) = x(t) cos(27 f,t) (2.24) 


where x(t) 1s defined in (2.16). BPSK is a simplification of M-QAM with only an inphase 
component. The process of creating x,(¢) from x() is known as upconversion. 


* Plot x,(¢) for ¢ © [0, 8] and interpret the results. Choose a reasonable value of f, so that it is 
possible to interpret your plot. 
Answer: Using a small value of f, shows the qualitative behavior of abrupt changes in Figure 


2.18. The signal changes phase at the points in time when the bits change. Essentially, 
information is encoded in the phase of the cosine, rather than in its amplitude. Note that with 
more complicated pulse-shaping functions, there would be more amplitude variation. 
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Figure 2.18 Output of a BPSK modulator with a square pulse shape from Example 2.11 when the 
carrier frequency is very small (just for illustration purposes) 


¢ The receiver in a wireless system usually operates on the baseband signal. Neglecting noise, 
suppose that the received signal is the same as the transmit signal. Show how to recover x(f) 
from x,(¢) by multiplying by cos(2zf,t) and filtering. 


Answer: After multiplying x,(¢) by cos(27f,/), 
z(t) cos(27 f,t) = x(t) cos” (27 fet) (2.25) 


(t) 


2 


bs 


(1 + cos(47 f,t)). (2.26) 


The carrier at 2/, can be removed by a lowpass-filtering (LPF) operation, so that the output of the 
filter is a scaled version of x(t). 


2.7.3 Demodulation with Noise 


When the channel adds noise to the passband modulated signal that passes through it, the symbol 
sequence can be recovered by using a technique known as matched filtering, as shown in Figure 2.19. 
The receiver filters the received signal y(t) with a filter whose shape 1s “matched” to the transmitted 
signal’s pulse shape g,,(¢). The matched filter limits the amount of noise at the output, but the 


frequencies containing the data signal are passed. The output of the matched filter is then sampled at 
multiples of the symbol period. Finally, a decision stage finds the symbol vector closest to the 
received sample. This demodulation procedure and the mathematical definition of the matched filter 
are covered in detail in Chapter 4. In practical receivers this is usually implemented in digital, using 
sampled versions of the received signal and the filter. 

t=kT 
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Figure 2.19 The demodulation procedure based on atehed filtering 


Example 2.12 

In this example we consider the baseband BPSK signal from Example 2.10. A simple detector is 
based on the matched filter, which in this example is the transmit rectangular pulse g,,(t) in Example 
2,10. 


* Show that tli x(t) gix(t — kT )dt = s[k] where x(4) is given in (2.16). 


Answer: 


kT+T kTiT <x 
/ r(t)9tx(t — kT)dt = / S— s[n|gix(t — nT) gix(t —kT)dt (2.27) 
LRT: JT 


n=—co 


x kT+T 
S- / s[n|gtx(t — nT )grx(t —kT)dt (2.28) 
n=—oo0 VAT 
kT +T 
= | slklgtx(t — kT )gix(t —kT)dt + 
kT 


kT+T eS 
s[n] dtx(t—F Jgix(t — kT )dt (2.29) 


= Ts{k] (2.31) 


The original signal can be recovered by dividing by T. 


¢ Plot the signal at the output of the matched filter, which is also a rectangular pulse, int © [—2, 
10], when x(f) is the baseband BPSK signal from Example 2.10 corrupted by noise. Plot the 
sampled received signal at t = 1, 3, 5, 7. Interpret the results. 


Answer: Figure 2.20(a) shows the output of the matched filter for the signal x(t) in Example 
2.10 corrupted by additive noise. The samples of the received signal at multiples of the symbol 
period are represented in Figure 2.20(b) to show the output of the sampling stage. Then, the 
detector has to find the closest symbol to this received vector. Despite the noise, the detector 
provides the right symbol sequence [—1, —1, 1, -1]. 


3 4 
Dy 3 
2 
1 
1 
0 0 
= _{ 
-2 -2 
3 -3 
-4 
-2 0 2 4 6 8 10 -2 0 2 4 6 8 10 
time time 


(a) (b) 
Figure 2.20 (a) Output of the matched filter for the signal in Example 2.10 corrupted by noise; (b) 
output samples of the sampling stage after the matched filter (circles) and transmitted symbols (x- 
marks) 


When demodulating a general passband signal, we need to extract the symbol information from the 
phase or the amplitude of an RF carrier. For M-QAM, the demodulator operates with two basic 
demodulators in parallel to extract the real and imaginary parts of the symbols. This demodulator can 
be implemented as shown in Figure 2.21. The passband received signal y(f) 1s corrupted by noise as 
in the baseband case, and the matched filters remove noise from both components. 


t=nT 
Matched 
filter 


Re {S[n]} 


Detector 


t=nT 
sin(2xf,6 Im {Sn} 
Matched — 
w filter 


Figure 2.21 Block diagram of an M-QAM demodulator 


In practice, however, the channel output signal is not equal to the modulated signal plus a noise 
term; signal distortion is also present, thus requiring more complicated demodulation methods, which 
are briefly described in Section 2.7.4 and discussed in detail in Chapter 5. 


2.7.4 Demodulation with Channel Impairments 


Real wireless channels introduce impairments besides noise in the received signal. Further, other 
functional blocks have to be added to the previous demodulation structures to recover the transmitted 
symbol. Moreover, practical realizations of the previous demodulators require the introduction of 
additional complexity in the receiver. 


Consider, for example, the practical implementation of the demodulation scheme in Figure 2.19. It 


involves sampling at a time instant equal to multiples of the symbol duration, so it is necessary to find 
somehow the phase of the clock that controls this sampling. This is the so-called symbol timing 
recovery problem. Chapter 5 explains the main approaches for estimating the optimal sampling phase 
in systems employing complex PAM signals. 

In a practical passband receiver some additional problems appear. The RF carriers used at the 
modulator are not exactly known at the receiver in Figure 2.21. For example, the frequency generated 
by the local oscillator at the receiver may be slightly different than the transmit frequency, and the 
phase of the carriers will also be different. Transmission through the wireless channel may also 
introduce a frequency offset in the carrier frequency due to the Doppler effect. Figure 2.22 shows the 
effect of a phase offset ina QPSK constellation. Because of these variations, a practical M-QAM 
receiver must include carrier phase and frequency estimation algorithms to perform the demodulation. 
Chapter 5 describes the main approaches for phase and frequency offset correction for single-carrier 
and OFDM systems. 


(a) (b) 
Figure 2.22 (a) QPSK constellation; (b) effect of phase offset in a QPSK constellation 


Example 2.13 


Consider BPSK with passband modulation, continuing Example 2.11. Suppose that during the 
downconversion process, the receiver multiplies by cos(2z(f, + €)t), where ¢ # 0 represents the 


frequency offset. Show how the received signal, in the absence of noise, is distorted. 


Answer: 

z,(t) = LPF{x,(t) cos(27(f, + €)t)} (2.32) 
= LPF {x(t) cos(27 f.t) cos(27(f. + €)t)} (2.33) 
— LPF {.x(t) = (cos(2r fet — 2rn(f. + €)t) + cos(Qrf.t + 27(f. + €)t))} (2.34) 
= LPF{2 (t) = (cos(2met) + cos(27(2f, + €)t))} (2.35) 


Because of the term cos(27¢?) in the demodulated signal, reconstruction by simple filtering no longer 
works, even for small values of ¢. 


As described in Section 2.3, the wireless channel also introduces multipath propagation. The input 
of the demodulator is not the modulated signal x(t) plus a noise term, but it includes a distorted 
version of x(t) because of multipath. This filtering effect introduced by the channel has to be 
compensated at the receiver using an equalizer. Chapter 5 introduces the basis for the main channel 
equalization techniques used in current digital communication systems. 


2.8 Summary 
* Both analog and digital communication send information using continuous-time signals. 
¢ Digital communication is well suited for digital signal processing. 


¢ Source coding removes the redundancy in the information source, reducing the number of bits 
that need to be transmitted. Source decoding reconstructs the uncompressed sequence either 
perfectly or with some loss. 


¢ Encryption scrambles data so that it can be decrypted by the intended receiver, but not by an 
eavesdropper. 


¢ Channel coding introduces redundancy that the channel decoder can exploit to reduce the effects 
of errors introduced in the channel. 


¢ Shannon’s capacity theorem gives an upper bound on the rate that can be supported by a channel 
with an arbitrarily low probability of error. 

¢ Significant impairments in a wireless communication system include additive noise, path loss, 
interference, and multipath propagation. 

¢ Physically transmitted signals are necessarily analog. Digital modulation is the process by 
which bit sequences containing the information are transformed into continuous signals that can 
be transmitted over the channel. 


¢ Demodulating a signal consists of extracting the information bits from the received waveform. 


¢ Sophisticated signal processing algorithms are used at the receiver side to compensate for the 
channel impairments in the received waveform. 


Problems 

1. Short Answer 
(a) Why should encryption be performed after source coding? 

(b) Why are digital communication systems implemented in digital instead of completely in 
analog? 

2. Many wireless communication systems support adaptive modulation and coding, where the 
code rate and the modulation order (the number of bits per symbol) are adapted over time. 
Periodic puncturing of a convolutional code is one way to adapt the rate. Do some research and 
explain how puncturing works at the transmitter and how it changes the receiver processing. 

3. Adiscrete source generates an information sequence using the ternary alphabet {a, b, c} with 
probabilities P(s = a) = 1/4 P(s = 6) = 1/3 ang P(s = c) = 5/12 How many bits per 
alphabet letter are needed for source encoding? 

4. Look up the stream ciphering used in the original GSM standard. Draw a block diagram that 
corresponds to the cipher operation and also give a linear feedback shift register 
implementation. 

5. Determine the CRC code used in the WCDMA (Wideband Code Division Multiple Access) 
cellular system, the first release of the third-generation cellular standard. 

(a) Find the length of the code. 
(b) Give a representation of the coefficients of the CRC code. 
(c) Determine the error detection capability of the code. 

6. Linear Block Code Error control coding is used in all digital communication systems. 
Because of space constraints, we do not cover error control codes in detail in this book. Linear 
block codes refer to a class of parity-check codes that independently transform blocks of k 


information bits into longer blocks of m bits. This problem explores two simple examples of 
such codes. 


Consider a Hamming (6,3) code (6 is the codeword length, and 3 1s the message length in 
bits) with the following generator matrix: 


G= eS 6 
110 cee 
"ae 
a 


(a) Is this a systematic code? 


(b) List all the codewords for this (6,3) code. In other words, for every possible binary input of 
length 3, list the output. Represent your answer in table form. 


(c) Determine the minimum Hamming distance between any two codewords in the code. The 
Hamming distance is the number of bit locations that differ between two locations. A code 
can correct c errors if the Hamming distance between any two codewords is at least 2c + 1. 

(d) How many errors can this code correct? 

(ec) Without any other knowledge of probability or statistics, explain a reasonable way to 
correct errors. In other words, there is no need for any mathematical analysis here. 

(f) Using the (6,3) code and your reasonable method, determine the number of bits you can 
correct. 

(g) Compare your method and the rate 1/3 repetition code in terms of efficiency. 

7. HDTV Do some research on the ATSC HDTV broadcast standard. Be sure to cite your sources 
in the answers. Note: You should find some trustworthy sources that you can reference (e.g., 
there may be mistakes in the Wikipedia article or it may be incomplete). 

(a) Determine the types of source coding used for ATSC HDTV transmission. Create a list of 
different source coding algorithms, rates, and resolutions supported. 

(b) Determine the types of channel coding (error control coding) used in the standard. Provide 
the names of the different codes and their parameters, and classify them as block code, 
convolution code, trellis code, turbo code, or LDPC code. 

(c) List the types of modulation used for ATSC HDTV. 

8. DVB-H Do some research on the DVB-H digital broadcast standard for handheld devices. Be 
sure to cite your sources in the answers. Note: You should find some trustworthy sources that 
you can reference (e.g., there may be mistakes in the Wikipedia article or it may be incomplete). 
(a) Determine the types of source coding used for DVB-H transmission. Create a list of 

different source coding algorithms, rates, and resolutions supported. 

(b) Determine the types of channel coding (error control coding) used in the standard. Provide 
the names of the different codes and their parameters, and classify them as block code, 
convolution code, trellis code, turbo code, or LDPC code. 

(c) List the types of modulation used for DVB-H. 

(d) Broadly speaking, what is the relationship between DVB and DVB-H? 

9. On-off keying (OOK) 1s another form of digital modulation. In this problem, you will illustrate 
the operation of OOK. Let i[n] denote the sequence of input bits, s[”] a sequence of symbols, 
and x(t) the continuous-time modulated signal. 

(a) Suppose that bit 0 produces 0 and bit 1 produces A. Fill in the bit mapping table. 


(b) OOK is a type of amplitude modulation. Let g(t) denote the pulse shape and let 
zith= cm s|n|g(t — nT) (2.38) 


n 


denote the modulated baseband signal. Let g(t) be a triangular pulse shape function defined 


as 


jt—+| . -in 7 
ome =e 2 = eee Wasa 
alt) 1-22! itt € [0,7], om 


0, otherwise. 


Draw a graph of x(¢) for the input 7[0] = 1, i[1] = 1, i[2] = 0, i[3] = 1, with A =4 and T= 2. 
You can illustrate for t © [0, 8]. Please draw this by hand. 

(c) Wireless communication systems use passbands signals. Let the carrier frequency f, = 
1GHz. For OOK, the passband signal is 


p(t) = cos(2aft)rtt). (2.40) 


Plot x(t) for the same range as the previous plot. Be careful! You might want to plot using a 
computer program, such as MATLAB or LabVIEW. Explain how the information is encoded 
in this case versus the previous case. The process of going from x(¢) to x,(¢) is known as 


upconversion. Illustrate the qualitative behavior with a smaller value of fo. 

(d) The receiver ina wireless system usually operates on the baseband signal. Neglect to noise, 
suppose that the received signal is the same as the transmit signal. Show how to recover x(f) 
from x,(¢) by multiplying x,(¢) by cos(27/,¢) and filtering. 

(e) What happens if you multiply instead by cos(27(f, + €)t) where ¢ # 0? Can you still recover 
x(t)? 


(f) A simple detector is the matched filter. Show that fe " x(t) g(t — kT)dt = 4 s[k} 
(g) What happens if you compute | ae ax(t)g(t —kT —r)dt, where t © (0, 7)? 


10. M-ary PAM is a well-known baseband modulation method in which one of M allowable 
amplitude levels is assigned to each of the VM vector symbols of the signal constellation. A 
standard 4-PAM signal constellation consists of four real numbers located symmetrically 


around the origin, that 1s, 
; a 2 ie Bacal 
C4 PAM = OY te (2.41) 


. = 


Since there are four possible levels, one symbol corresponds to 2 bits of information. Let 
i[n] denote the sequence of input bits, s[”] a sequence of symbols, and x(t) the continuous-time 
modulated signal. 


(a) Suppose we want to map the information bits to constellation symbols by their 
corresponding numeric ordering. For example, among the four possible binary sequences of 
length 2, 00 is the smallest, and among the constellation symbols, — 3 is the smallest, so we 


map 00 to — 3. Following this pattern, complete the mapping table below: 


(b) Let g(t) denote the pulse shape and let 
a(t)'= a s|njg(t — nT) (2.42) 
rn 


denote the modulated baseband signal. Let g(t) be a triangular pulse shape function defined 
as 


_glt-al ; E€ [0,T 
ay={ 2, ift¢ [0,7], (2.43) 


0. otherwise. 


By hand, draw a graph of x(t) for the input bit sequence 11 10 00 01 11 01, with T= 2. You 
can illustrate for t+ © [0, 12]. 

(c) In wireless communication systems, we send passband signals. The concept of passband 
signals is covered in Chapter 3. Let the carrier frequency be f,; then the passband signal is 
given by 


Lp(t) = cos(27 f.t)x(t). (2.44) 


For cellular systems, the range of f, is between 800MHz and around 2GHz. For illustration 
purposes here, use a very small value, f, = 2Hz. You might want to plot using a computer 
program such as MATLAB or LabVIEW. Explain how the information is encoded in this case 
versus the previous case. 

(d) The receiver ina wireless system usually operates on the baseband signal. At this point we 
neglect noise and suppose that the received signal is the same as the transmit signal. Show 
how to recover x(t) from x,(¢) by multiplying x,(¢) by cos(2a/,t) and filtering. The process 
of recovering a baseband signal froma passband signal is known as downconversion. 

(e) What happens if you multiply instead by cos(27(f, + €)t), where ¢ # 0? Can you still recover 
x(t)? This is called carrier frequency offset and 1s covered in more detail in Chapter 5. 


(f) A simple detector is the matched filter. Show that i x(t)g(t — kT )dt = 4s[k} 


(g) What happens if fee ax(t)g(t —kT —r)dt: where t © (0, 7)? This is called symbol 
timing offset and is covered in more detail in Chapter 5. 


Chapter 3. Signal Processing Fundamentals 


A key feature of this book is the signal processing approach to wireless communications. This chapter 
reviews the fundamentals of signal processing, providing critical mathematical background that is 
used in subsequent chapters. We start with a review of concepts from signals and systems, including 
continuous-time and discrete-time signals, linear time-invariant systems, Fourier transforms, 
bandwidth, the sampling theorem, and discrete-time processing of continuous-time signals. These 
basic signal concepts are used extensively in this book. Then we explain concepts from statistical 
signal processing, providing background on probability, random processes, wide-sense stationarity, 
ergodicity, power spectrum, filtered random processes, and extensions to multivariate processes. The 
exposition focuses on just the main items from statistical signal processing that are needed in later 
chapters. Next we introduce passband signals and the associated concepts of the complex baseband 
equivalent, the complex envelope, and the complex baseband equivalent system. This leads to an 
important discrete-time abstraction of wireless communication channels. Then we provide some 
mathematical background on two additional topics. We review key results on multirate signal 
processing, which are used in subsequent chapters to implement pulse shaping in discrete time. 
Finally, we provide some background on estimation, including linear least squares, maximum 
likelihood, and minimum mean squared error estimators. We make extensive use of linear least 
squares to solve problems related to channel estimation and equalization. 


3.1 Signals and Systems 


In this section we introduce key concepts from signals and systems, which are essential for a signal 
processing approach to wireless communication. First we introduce continuous-time and discrete- 
time signal notations. Then we review linear time-invariant systems, which are used to model the 
effects of filtering and multipath in the wireless channel. Next we summarize the continuous-time, 
discrete-time, and discrete Fourier transforms. There are many cases where looking at the frequency 
domain is useful, including to determine the bandwidth of a signal or finding an easy way to equalize 
a channel. We use the Fourier transform to define different notions of bandwidth. Next we review the 
Nyquist sampling theorem, which forms the fundamental connection between bandlimited continuous- 
time and discrete-time signals. Finally, we use the sampling theorem to develop a discrete-time linear 
time-invariant equivalent of a continuous-time linear time-invariant system. This last result is used to 
develop a discrete-time equivalent of the continuous-time communication system model, leading to 
the wide use of digital signal processing in communications. 


3.1.1 Types of Signals and Notation 
At different points in the digital communication system there are different kinds of signals. We take 
some time here to describe some of these signals and also to introduce notation that is used in the rest 
of the book. A simple taxonomy of signals is based on the discreteness of a signal: 
¢ x(t): A continuous-time signal where ¢ is a real variable that indicates time, and x(t) is the value 
of the signal at that time 
¢ x[n]: A discrete-time signal where 7 is an integer that indicates discrete time, and x[7] is the 
value of the signal at that discrete time 
* xXq[n]: A digital signal where n 1s an integer that indicates discrete time, and xQ[7] is the 


quantized value of the signal at that discrete time 
Quantization maps a continuous variable to one of a finite number of levels. These signal levels 
can be represented using binary notation from which we get the notion of digital information. Figure 
3.1 shows examples of all these types of signals. 
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Figure 3.1 Different types of signals according to their discreteness: (a) continuous-time signal; 
(b) discrete-time signal; (c) digital signal quantized to four levels of [-1, 0, 1, 2] corresponding to 
2 bits of quantization 


In all the cases mentioned above, the signal may be real or complex. A complex signal can be 
thought of as the sum of two real signals, for example, x[n] = a[n] + jb[n]. The signals transmitted in 
any communication system are always real. Complex signals, though, are used to more efficiently 
represent passband signals, resulting in more compact expressions for the signal and channel. 
Complex discrete-time signals are used to represent bandlimited signals by their sampled 
equivalents, allowing the ready application of DSP. 


From a mathematical perspective, this book deals primarily with discrete-time signals. In most 
DSP, FPGA, or ASIC implementations, however, the signals are actually digital. This requires 
designing special algorithms to account for the format in which the quantized signals are stored and 
manipulated, which is important but beyond the scope of this book [273, Section 7.3.3]. 


3.1.2 Linear Time-Invariant Systems 


Many of the challenges in wireless communications are a result of temporal dispersion of the 
transmitted signal due to multipath propagation in the wireless channel and the frequency selectivity 
of analog filters in the analog front end. Multipath propagation results from the plurality of paths 
between transmitter and receiver resulting from reflection, penetration, diffraction, and scattering. 
Depending on the carrier frequency and the bandwidth of the signal, the signals on these paths can 
arrive with different delays, attenuations, and phase shifts, resulting in dispersive distortion of the 
transmitted signal. Analog filters are widely used in the analog front end in both the transmitter and 
the receiver, to help meet the transmit spectral mask, correct for imperfect digital-to-analog 
conversion, remove intermodulation products, and eliminate noise and interference from other bands. 
Digital filters may be used to supplement analog filtering. These aggregate effects are known 
collectively as the channel. 


Linear time-varying systems are good models for wireless systems. Linearity is reasonable to 
describe far-field propagation. The model is time varying because sources of multipath can change 
over time because of the motion of the transmitter, receiver, or objects in the environment. Given the 
linear time-varying assumption, and in the absence of noise, the received continuous-time signal y(t) 
is related to the transmitted continuous-time signal x(¢) through a superposition integral between the 
time-varying impulse response /,(t, T) via 


y(t) = | | h.(t, 7)a(r)dr. (3:1) 


‘2 @) 


The time-varying impulse response /,(¢, t) captures all the effects of multipath and filtering in the 
transmission path as well as the variation of the transmission path over time. 


Most wireless communication systems are designed so that the time variation of the channel can be 
neglected in the design of signal processing algorithms, at least during a short interval of time. The 
main reason is that dealing with time-varying channels /,(¢, t) is challenging for the receiver: they 


can be difficult to estimate, track, and equalize. Consequently, most systems are designed so that 
information is sent in short bursts, which are shorter than the coherence time of the channel (a 
measure of how fast the channel varies, as discussed further in Chapter 5). Therefore, for purposes of 
building a mathematical model, wireless communication channels are modeled as linear time- 
invariant (LTT) systems, which is known as the time-invariance assumption. All the models 
considered in this book are accurate for linear time-invariant systems where the channel is 
approximately constant over a period long enough that the channel can be estimated and compensated 
at the receiver. 


Under the LTI assumption, the channel is characterized by its impulse response /,(t). For an LT] 
system, the input and output in (3.1) are related through the well-known convolution integral 


i > @) 
y(t) = | h.(r)a(t — 7)dr. (3.2) 
J —oo 

This gives a mathematically tractable relationship between the input and output, which has been 
widely studied [249, 186]. In discrete time, the convolution integral becomes a sum. Given input x[7], 
output y[7], and an LTI system with impulse response /A[n], the input and output are related as 


y(n] = i. hléla|[n — €]. (3.3) 


f=—-& 


Discrete-time LTI systems are used to model the entire communication system, by virtue of the 
bandlimited property of complex baseband signals (see Section 3.3), the sampling theorem (see 
Section 3.1.5), and an equivalence between continuous-time and discrete-time processing of signals 
(see Section 3.1.6). 


3.1.3 The Fourier Transform 


The frequency domain is incredibly useful when engineering wireless communication systems. For 
example, our entire notion of bandwidth is built from understanding signals and systems in the 
frequency domain. It is also natural to look into the frequency domain, since the convolution becomes 
multiplication. Real-world signals can be viewed in the frequency domain through a spectrum 
analyzer. 


In this section, we review some relevant Fourier transform relationships, including transforms of 
both continuous-time and discrete-time signals. Tables of transforms of common signals and 
properties are provided for each case. We also provide several examples involving both the direct 
calculation of transforms and applications of their properties. 


First, we consider the continuous-time Fourier transform (CTFT) of a well-behaved signal.! A 
continuous-time function x(t) and its Fourier transform <(f) are related by the synthesis and analysis 
equations, 

1. There are several conditions for a Fourier transform to exist: that the integral of |f(4)| from —co to oo exists, there are a finite number 


of discontinuities in f(¢), and f(t) has bounded variations [50]. All physically realizable signals have Fourier transforms. Discussion 
of more technical aspects, such as Lipschitz conditions, is beyond the scope of this book. 


“OC 
Analysis x(f)= | a(t)e J? Ft dt (3.4) 
J —OO 
_ ; 
Synthesis x«(t) = / x(f)el?*Ftdf. (3.5) 
— OO 


We often use the shorthand <(/) = F'{x(t)} and x(t) = F {x(f)}. 


While the synthesis and analysis equations may be applied directly, it is convenient to leverage the 
common properties of the Fourier transform in Table 3.1 and the basic Fourier transforms in Table 
3.2. An example of the direct application of the analysis equation is provided in Example 3.1. The 
application of transform pairs and properties is shown in Example 3.2 and Example 3.3. 


| ita iD 
y(t) y(f) 
Linearity ax(t) + by(t) ax(f) + by(f) 
Time shifting z(t — to) e—)2 feto x/f) 
Frequency shifting el2™ fot y(t) 
Conjugation Sad 69 
Time reversal x(—t) 
Time scaling x(at) 
Convolution a(t) * y(t) = f° a(r)y(t — r)dr 
Autocorrelation x(t) * x*(—t) 
Multiplication x(t)y(t) x(f) *y(f) = f°, X(@Y (fF — 0)d0 
Differentiation in time | © wa (2m bw gee 1 
Integration Tne. T)dr jisy x(f) +4 5X (0)d(f) 
Differentiation in ar (+)" ax) 
frequency 
Modulation (1) a(t)el27 fot x(f — fo) 
Modulation (2) a(t) cos(2z7 fot) 5 [x(f — fo) + X(f + fo)] 
Modulation (3) x(t) sin(27 fot) a [x(f — fo) —X(f + fo)] 
x(f) = x*(-f) 
Re {x(f)} = Re{x(—f)} 
Jonjugate symmetry a(t) is real Im { x(f)} = —Im{x(—f)} 
for real signals Ix(f)| =|x(-f)| 
&£x(f) = —4x(—-f) 
Symmetry for real and | z(t) real and even x(f) real and even 
even signals 
Symmetry for real and | x(t) real and odd x(f) purely imaginary and odd 
odd signals 
te(t) = Ev{a(t)}  [a(t)real] Re { x(f)} 


Even-odd to(t) = Od{x(t)} — [x(t)real] jlm {x(f)} 


decomposition for real 

signals 

Duality x(t) <——> x(f) x(t) —+ «(—f) 

Parseval’s theorem ee meen dt = f-| x(f)/? df 
foes oan = = Soe ' ee 

(e(t) and y(t) real) (ak sue: =f x(fy(fas 


Table 3.1 Continuous-Time Fourier a Relationships 


Time-Domain Signal x(t) 


Impulse 

DC 

Complex exponential 
Cosine 

Sine 

Unit step 

Sign 


Impulse train 


Fourier series 


Rectangle pulse 


Triangle pulse 


Sinc pulse 
Sine? pulse 
Exponential pulse 


Decaying exponential 


Linear decaying 


d(t) 

1 

el2% fot 

cos(27 fot + 0) 
sin(27 fot + 0) 


1 ¢t>0 
ae ) t<0 


1 t>0 
sett) = | t<0 


5 6(t—kT) 
KkK=—oo 


oO . 
pe apz.ei27 fokt 
k=—090 


where a, = $ fp e(the 327 foktde 


- 
1 ei<e 
0 elsewhere 


\t 
l-% 


rect (4) = 


Nb) =4 
sinc (Wt) = 
sinc? (Wt) 
e—ltl with a > 0 
e~%tu(t) with Re {a} > 0 
te—*'u(t) with Re {a} > 0 


sin(rWt) 
awWt 


te %tu(t) with Re {a} >0 


(n—1)! 
i 
+ 


lt] << W 
0 elsewhere 


Frequency-Domain Signal x(/) 


5 [el?d(f — fo) + e15(f + fo)] 


j 


[e!°5(F — fo) — e1°5(F + fo)] 


— 7m) 


> and(f —nfo) 


n=— oo 


Tare (fT) = snirit T) 


Ww 


sinc? (fW) 


2a 
a--+ (2x f)? 
i 


anand 


(a+j2nf)* 


{a 


eae. 2 
bj2nf)” 


—jmsgn(f) 


Table 3.2 Continuous-Time Fourier Transform Pairs 


Example 3.1 
Compute the Fourier transform of 


t 1 fte<2 
rect { — } = ie 26 
( Ye ) 0 elsewhere (3.6) 


using the analysis equation to verify the result in Table 3.2. 
Answer: Through direct computation, 


t — t + 
a {rect (=)} = / rect (=) e Jr ftat (3.7) 


T/2 
=: / ein fay (3.8) 
-T/2 
= — T/2 
=z Qe seas (3.9) 
ey | -T/2 
el2nfFt == ei2n fF 
- ee 3.10 
j2nf ( ) 
_ sin(n fT) (3.11) 
wf 
Example 3.2 
Compute the Fourier transform of 
t 1-f est 
A (=) = Tf el S (3.12) 
f 0 elsewhere 
using the fact that A(-) = zrect(#) * rect( 7): 


Answer: The Fourier transform is 


F A (=)} = {greet @ aaa (=) } (3.13) 
aF {rect (=) } F {rect (=) } (3.14) 


- sin?(a fT) a se 
= “APT (3.15) 


Example 3.3 
Compute the inverse CTFT of x(2f) cos(27ftp), where tp 1s constant. 


Answer: Exploit the properties 


Foxe yy = s(t/2) (3.16) 
and 
F—" {y(f) cos( (2nfto)} = 5 [F {y(f)e?Fto} + F—} fy( fle nF }) (3.17) 
- ; [y(t — to) + y(t + to)]. eae) 
This gives 
F~* {x(2f) cos(27 fto)} = ; [a(t — to)/2) + 2((t + to)/2)]. (3.19) 


The CTFT’s analysis and synthesis equations are strictly defined only for well-behaved functions. 
Without going into details, this applies to functions where the integrals in (3.5) and (3.4) exist. 
Periodic functions do not fall under the umbrella of well-behaved functions since they contain infinite 
energy. This means that an infinite integral over a periodic function is unbounded. Unfortunately, we 
send information on RF carriers, which are sines and cosines; thus periodic functions are a part of our 
wireless life. A way around this is to recall the definition of the Fourier series and to use our poorly 
defined friend the Dirac delta function. 

Consider a periodic signal x(t). The period 7'> 0 is the smallest real number such that x(t) = x(t + 


T) for all ¢. The continuous-time Fourier series (CTFS) of a (well-behaved) periodic signal x(t) can 
be defined as 


1 z ra + 2a y 
Analysis x|n| = 7 [ g(t)e 3 TF dt (3.20) 
J0 
OO 
Synthesis «(t) = a x [nj Tr", (3.21) 
n=—co 


Note that the CTFS creates as an output a weighting of the fundamental frequency of the signal 
exp(j27/7) and its harmonics. Using the CTFS, we can define the Fourier transform of a periodic 
signal as 


Analysis x(f) = » x [nd (1 — 7”) (3.22) 


Synthesis z(t) = x( fyel?Ftadf. (3.23) 


We use the notion of “define” here because the 6 function is a generalized function and not a proper 
function. But the intuition from (3.22) is accurate. In a spectrum analyzer, the Fourier spectrum of a 
periodic signal has a comblike structure. 


A example application of the application of the CTFS to compute the Fourier transform of a 
periodic signal is provided in Example 3.4. Generalization of this example gives the transforms for 
the sine and cosine in Table 3.2. 


Example 3.4 
Compute the Fourier transform of x(t) = exp(27/,0). 


Answer: Note that the period of this signal is 7 = 1/f,. From (3.20): 


LP" eae wee | 
x[n] = = | etn int eI ay (3.24) 
LT’ Jo 
‘ma ee 
= if Te dt (3.25) 
0 
= §[n — 1). (3.26) 


Then from (3.22): 


Note in (3.28) we use both the Kronecker delta 5(n) and the Dirac delta 6(/). 


Because most of the signal processing in a communication system is performed in discrete time, we 
also need to use the discrete-time Fourier transform (DTFT). The DTFT analysis and synthesis pair 
(for well-behaved signals) is 


Analysis x(e!?"/) = S- a[njeI?7F" (3.30) 
n=—co 
1/2 

Synthesis zin| = | x(el27 In) olan ing Ff, (3.31) 
J—1/2 


The DTFT is also often written in radians using = 2zf, in other books. The DTFT x(e?"/”) is 
periodic inf. This implies that the DTFT is unique only on any finite interval of unit length, typically f 
© (-1/2, 1/2] or f © [0, 1). 


Transform properties are illustrated in Table 3.3, and relevant transforms are presented in Table 
Sf: 


z[n] x(el27f) 

pi ror 

Linearity ax|n] + by[n] ax(el@™F) + by(el2*F) 

Time shifting z[n — no] e277 fro x(el27f) 

Frequency shifting el2r for gin] x(ei27(f—fo)) 

Conjugation x*[n] x *(e7J27f) 

Time reversal z[—n] x(eJ27Ff) 

Conjugate symmetry | 2[n]} real x(el27f) = x*(e—i27F) 

Even symmetry z{n] = 2[—n] x(el2™f) = x(e—J27F) 

Odd symmetry z[n] = —2[—n] x(el2*f) = —x(e—J27f) 

Convolution z[n] * y[n] = > z[mly[n —m] — x(e)2*F )y(el2*F) 


Autocorrelation a(n] * 2*(—n] |x(el27f)|2 
Multiplication x[n}y[n] nye x(eh2*9)y (el2*(F-9) )d@ 
Multiplication by n nx[n] = F x(el27F) 

oo 


Sum > an] x(e)270) 


n=— oO 
Value at origin z(0] fin x(el2"F df 
Modulation (1) a[njei2™ fon (ean fo)) 
Modulation (2) z[n] cos(27 fon) 5 [ x(ei27(F— fo) ) e x(el27(F+fo))] 
Modulation (3) z[n] sin(27 fon) 7 [ x(el2*(F-Fo)) — x(el2*(F+fo))] 


—1/2 


, . 2 1/2 jan fy|2 
Parseval’s theorem > len =. | x(el2"F)|" af 


n—-—co 


Table 3.3 Discrete-Time Fourier Transform Relationships 


Function Name Sequence Fourier Transform 


Impulse 


DC 


[nl] 
d[n — no] 


1 (-coo <n < ow) 


Complex exponential | e/27/o” 


Cosine 


Unit step 


Window 


Sinc 


os(27 fon + @) 
sin(27 fon + @) 


a” u{n] 
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1 |InjxsM 
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-W sin|2 >\ 1 j 
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sin 2r fp 
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2 0(f — fo —k) 
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D4 [eer - fom) 
te Ble o 
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mrt} Do O(f —k) 
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sin(27(n+4)f) 


sin(awf) 


Tia —ao rect (55) 
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1—2r cos 27 fpe 327) 472e—)474 


Table 3.4 Discrete-Time Fourier Transform Pairs 


Example 3.5 
The frequency response of an ideal lowpass filter with cutoff frequency f, is given by 


j2nf\ 1 If | < fe ie 
h(ei2"f) = ‘ ye (3.32) 


Determine its discrete-time impulse response. 
Answer: Applying (3.31): 


1/2 Z 7 
fiali= J h(ci2*S )ei2n ing f (3.33) 
—i/2 
fo 
=| et ing f (3.34) 
— fi 
t x : 
= |——e!?7Jn (3.35) 
j2mn _f. 
og BBC) (3.36) 
TT 


Note that since h[n] is not zero for n < 0, the filter is noncausal. 


Example 3.6 
Compute the DTFT of x[n] cos(2af,n) cos(2af5n), where f; and 5 are constant. 


Answer: Using the properties in Table 3.3, 
F {x|n| cos(27 fn) cos(27 fon)} = 


S |F {x[n] cos(2m fyn)} (e!?*4—-J2)) + F {a[n] cos(2x fin)} rih)| (3.37) 
and 


F {x[n] cos(2m fyn)} (el?*™ FFI2)) = = | x (ee Fth-h)) 4 x(sntteain)| . (3.38) 


DPolRe 


Therefore, 


F {x|n] cos(27 fin) cos(2m fon)} = 


1 | x(einF-Fe-Fa)y os x(el2"F—-fatfi)) A. x (eft fa—fi)) oa x(ePn tt ieth)| (3.39) 


Example 3.7 
Compute the inverse DTFT of x(e?*/))(1 — je?™ SD) + x(eP2U\(1 + je?" 1), where f; is 
constant. 

Answer: Using Table 3.3, 


gal { x(eltn ih ) r x(el2m ith )} _ 22x[n] cos(27 fin), (3.40) 
then it also follows that 
Ee { —jx(el2*F-I1))@i2(F—fa) ee ey 


= 2F- Lf x i2 20) sin(2 m fin) (3.41) 
= 2a[n + ii Lal (3.42) 


Therefore, the answer is x[n] cos(2af,n) + 2 x[n + 1] sin(2af;n). 


The DTFT provides a sound theoretical analysis of discrete-time signals. Unfortunately, it is 
difficult to employ in real-time systems because of the continuous frequency variable. An alternative 
is the discrete Fourier transform (DFT) and its easy-to-implement friend the fast Fourier transform 
(FFT) when the signal length is a power of 2 (and in some other cases as well). Of all the transforms, 
the DFT is the most useful from a wireless signal processing perspective. It is critical in enabling the 
low-complexity equalization strategies of SC-FDE and OFDM, which are discussed further in 
Chapter 5. 

The DFT applies only to finite-/ength signals; the DTFT works for both finite and infinite-length 
signals. The DFT analysis and synthesis equations are 


N-1 

Analysis x [k] = >. a|njei kn R= 6, 1M 1 (3.43) 
n=0 
1 N-1 

Synthesis x[n| = W >, x[kldv*" n=0,1,...,N—-1. (3.44) 
*" k=0 


Note the presence of discrete time and discrete frequency. The DFT equations are often written in an 
alternative form using Wy = exp(—j27/N). 


A list of relevant properties is provided in Table 3.5. There are some subtle differences relative to 
the DTFT. In particular, shifts are all given in terms of a modulo by N operation denoted as (-),y. This 
ensures that the argument falls in 0, 1,..., N— 1, for example, (5), = 1 and (-1)4 = 3. Figure 3.2 
illustrates how to compute the circular shift x[((n + 2))<] of a finite-length signal x[n] when N = 5. 


The circular shift modulo N can be computed from the linearly shifted version of x[n] in Figure 3.2(a) 
by moving the two samples located at negative indices in the linearly shifted signal to the other end of 
the signal, as illustrated in Figure 3.2(b). It can also be computed by creating an N-periodic signal 


r{[n] from the original finite-length signal and extracting the first NV samples of z[n — 2], as shown in 
Figure 3.2(c). The result of the circular shift x[((” + 2))s5] is shown in Figure 3.2(d). 


x|n| x [k] 

x 1|n],x2[n] x 1[k], x2[k] 
ax,|n] + bra[n| ax 4[(k] + bx [k] 
x [n Nza|((—k))n] 
x{((n — m))y] we x(k 
Weta x{(k-9)n] 


5 x1|m]xo[((n — m))n]| x 1[k] x 2[k] 


#1 (n}ao[n| 4 > x [4 xo[((k — 2))n] 
z*|n] 

x*|((—n))n] 

Re{z[n]} 

jlm{a|n] I 

Lep|k] = 2 5 {a[n] + x*[((—n)) n]} 
®oplk] = 3 {x[n] — x*[((—n))]} 


t+ a(n] i 


hole bol 
Se 


x [k] = x*[((- 
Re {x [k]} = et eitt 
Symmetry properties Im { x{k]} = —Im {x 
|x [k]| = |x K(— ‘JN 
4£x [k] = —4£x|((- 
Lep[k] = § {z[n] + 2[((—n)) w]} Re{ x [k]} 
Lop|k] = 5 {x[n] — 2[((—n))n]} jlm{ x [k] } 


Table 3.5 Discrete Fourier Transform Relationships 


a(n | 2) 


x|((n + 2))5] 


. ee 
(a) (b) (c) (d) 
Figure 3.2 (a) Finite-length signal. (b) The circular shift can be computed from the linear shift of 
x[n]. (c) The circular shift can also be computed from a periodic version of the finite-length signal. 
(d) Circularly shifted version of x[n] for N= 5 


The modulo shift shows up in the shift property as well as in the duality between multiplication in 
the frequency domain and convolution in the time domain. In particular, a new kind of convolution is 
used called the circular convolution, denoted by ~@, which uses shifts of modulo N to compute the 
convolution sum: 


N-1 


x(n] ® xa[n] = > x[m|zra[((n — m))y]. (3.45) 


m=0 


The DFT of the circular convolution of two finite-length signals is the product of the DFTs of both 
signals, as shown in Table 3.5. The linear and the circular convolutions are different operations, but 
the DFT can be used to efficiently compute linear convolutions because in some cases both 
operations are equivalent. Thus, ifx,[”] of size N, and x,[n] of size N, are padded with zeros to 
length at least NV, + N,— 1, the linear convolution of x,[”] and x,[n] can be computed as the inverse 
DFT of the product x,[k]x,[k]. For more details on the DFT and the circular convolution see [248]. 
The circular convolution plays a key role in the development of SC-FDE and OFDM, as explained in 
Chapter 5. 

Now we provide several examples to illustrate computations with the DFT. Example 3.8 shows a 
basic DFT calculation. Example 3.9 gives an example of circular convolution. Example 3.10 shows 
how to perform the DFT by creating the input signal as a vector and defining a DFT matrix. Example 
3.11 through Example 3.14 provide example calculations of the DFT. 


Example 3.8 
Calculate the DFT of {1, -1, 1}. 


Answer: 
2 aoe 2 
X [0] = > a[nje—i27n0/3 — y. z[n] =1+-—-1+1=1 (3.46) 
n=0 n=0 
. 
A(f),= 2 z[nje~J27"1/3 pierre a. gees 4 errr S i + jv3 (3.47) 
n=0 
9 
X([2] = > a[nje~i2"2/3 — e7i2-0-2/3  _ p—j2m-1-2/3 4 o—j2n-2-2/3 _ 4 _ jv3— (3.48) 
n=0 
Example 3.9 
Calculate the circular convolution between the sequences {1, —1, 2} and {0, 1, 2}. 
Answer: 
2 
y[n| = S> a[Jh{(n — £)3] (3.49) 
é=0 
y|0| = S > x[Aa{(o — £)3] = x[0]A[0] + x[1)h[2] + x[2]h[1] = 0 (3.50) 
y({1] = S > [Jala — €)3] = x[OJA[1] + x[1]h[0] + 2[2}h[2] = 5 (3.51) 
y[2] = S> a[Jri(2 — £)3] = x[0)h[2] + x[1]h[1] + x[2]h[0] = 1 (3.52) 


y(n] = {0, 5, 1}. (3.53) 


Example 3.10 


It is useful to write the DFT analysis in matrix form for a length NV sequence. In this example, write 
this for general N, then compute the DFT matrix (which transforms the time-domain vector to the 
frequency-domain vector) explicitly for NV = 4. 


Answer: In the general case, 


x [0 1 1 ach 1 x0] 
x [1] 1 eit oa CURD [1] 
7 (3.54) 
x[N — 1] 1 eSRN-D | el (N-1)" z|N — 1] 
For N=4, e~iz = —j, so the DFT matrix D is 
1 1 l l l 1 it 1 
= _3)\2 _ 7\3 _ _ 
Lr ea er ip L =L 4, =i 
1 (-})? (-j)° (-j)? 1 j -1 -j 


Example 3.11 
Find DFT {DFT {DFT {DFT {x[n]}}}}. 
Answer: Use the reciprocity property of the DFT: x[] << X[k] @ X[n] — Nx[(-k),]. Therefore, 


DFT {2z[n]} = X[k] (3.56) 

DFT{DFT{z[n]}} = DFT{X[k]} = Na[(—k)n]| (3.57) 
DFT{DFT{DFT{z[n]}}} = DFT{N2[(—k)n]} = NX|(-k)y] (3.58) 
DFT{DFT{DFT{DFT{z[n]}}}} = DFT{NX[(—k)n]} (3.59) 
= N*z\k] (3.60) 


Example 3.12 
Assuming that Vis even, compute the DFT of 


1 neven 
tin| = 3.61 
in] h n odd. ( ) 


Answer: 
N/2-1 
Kk] = >. e J 2kn (3.62) 
n=0 


Ifk = 0 or k= N/2, then ,-—j3f2kn _ 1, sox k= 0] =x [k= N/2| = x. Ifk #0 and k # N/2, 


N/2-1 ee 
—j2%2kn l —= ¢€ j2mk . 9 fe 
) e view” = 7 _eek -— 0. (3.63) 
n=0 


Therefore, 


“i= > (6 + 6[k — N/2]). (3.64) 


Example 3.13 
Assuming that Vis even, compute the DFT of 


z(n| = a ns N/ A (3.65) 
0 N/2<n<N-1. 


Answer: From the definition, 
N/2-1 


eS Se ede (3.66) 


n=0 


It should be clear that x {k; = 0] = ‘. Fork #0, 


1 — enike eS (iF — ei) _— 
x [k] = er a es (3.67) 
—e oN e JN (el NW — eS N ) 
SOY oF yt 
Py ee , ont = ‘ 
= e JN (N/2-1)— ez) (3.68) 
sin( =) 
For & even gin( £2) = 0, and for k odd sin( £2) = Lei = (j)*-! = (—1)(*-))/2, thus 
A 7) = 4 j 
and r= 
( {k—1)/2 — 
x [k] = ¢ eS WN/2-1) Lene -; k odd (3.69) 
0) k even. 


Example 3.14 
Compute the DFT of the length NV sequence x[n] = a” using direct application of the analysis equation. 
Answer: Inserting into the analysis equation: 


x [k] 


s a" el iF kn (3.70) 


n=0 


1-—aN 


OT co ej ka ; 


Using the Fourier transform, we are now equipped to define a notion of bandwidth for signals. 


3.1.4 Bandwidth of a Signal 


The continuous-time signals transmitted in a wireless communication system are ideally bandlimited. 
This results because of the desire to achieve bandwidth efficiency (send as much data for as little 
bandwidth as possible) and also the way governments allocate spectrum. The classic definition of the 
bandwidth of a signal x(¢) is the portion of the frequency spectrum <(/) for which x(f) is nonzero. 
This is known as the absolute bandwidth. Unfortunately, signals in engineering settings are finite in 
duration, and thus with this proper definition their bandwidth is infinite per Example 3.15. 
Consequently, other definitions are used to define an operational notion of bandwidth (an 
approximation of the bandwidth of a signal that is treated as if it were the absolute bandwidth). 


Example 3.15 
Show that finite-duration signals are in general not bandlimited. 


Answer: Let x(t) denote a signal. We take a snapshot of this signal from [—7/2, 7/2] by windowing 
with a rectangle function to form x7 (t) = rect(t/T)x(t). The term windowing reminds us that we are 


“looking” at only a piece of the infinite-duration signal. From the multiplication property in Table 3.1: 
Kol Ff) — CF) ae Pane (FT): (3.72) 


Since sinc(f7) 1s an infinite-duration function, the convolution is also an infinite-duration function for 
general x(f). 


The bandwidth of a signal is defined based on the energy spectral density (ESD) or power spectral 
density (PSD). For nonrandom signals, which are not periodic, the ESD is simply the magnitude 
squared of the Fourier transform of the signal (assuming it exists). For example, the ESD of x(¢) is 
given as |x(f)/? and the ESD of x[7] is |x(e?". For a periodic signal, with period 7, o, the PSD is 
defined as 


DO 
So(f)= D> leln|P6(f — nfo), (3.73) 
m=—=—Co 
where x[n] are the CTFS of x(¢), and fy = 7 For a WSS (wide-sense stationary) random process, 


discussed in Section 3.2, the PSD comes from the Fourier transform of the autocorrelation function. 
We now define ways of calculating bandwidth assuming that a PSD is given (similar calculations 
apply for an ESD). 


Half-Power Bandwidth The half-power bandwidth, or 3dB bandwidth, is defined as the 
value of the frequency over which the power spectrum 1s at least 50% of its maximum 
value, that 1s, 


_ p(h) () Sad 
B= fap — Szap: (3.74) 


where pon and fe, A. are as shown in Figure 3.3. 


1% of totalarea 


Smex 


sn ee See 
0 f 
0 
(b) 
Figure 3.3 The different ways of defining bandwidth for (a) baseband signals and (b) passband 
signals 


XdB Bandwidth The XdB bandwidth is defined as the difference between the largest 
frequency that suffers XdB of attenuation and the smallest frequency. Similar to the half- 
power bandwidth, this can be expressed as 


(h) (é) ae: 
B= fxap — fxap: (3.75) 


where fw and fo are defined in the same manner as fin) and fo ; 
XdB XdB /3dB /3dB 


Noise Equivalent Bandwidth The noise equivalent bandwidth is defined as 


B- / P,(f)df. (3.76) 


Pr(fe) 


Fractional Power Containment Bandwidth One of the most useful notions of bandwidth, 
often employed by the FCC, is the fractional power containment bandwidth. The 
fractional containment bandwidth is 


B/2 


B/2 Tes 
/ P,(f)df =a | P,(f)df, (3.77) 
J0 


J —OO 


where @ is the fraction of containment. For example, if a = 0.99, then the bandwidth B 
would be defined such that the signal has a% of its bandwidth. 


The bandwidth depends on whether the signal is baseband or passband. Baseband signals have 
energy that is nonzero for frequencies near the origin. They are transmitted at DC, and bandwidth is 
measured based on the single-sided spectrum of the signal. Passband signals have energy that is 
nonzero for a frequency band concentrated about f= +f, where f, is the carrier frequency and f, -s> 0. 


Passband signals have a spectrum that is centered at some carrier frequency, with a mirror at —f¢ 


because the signal is real, and do not have frequencies that extend to DC. It is common to use a notion 
of bandwidth for passband signals, which is measured about the carrier frequency as illustrated in 
Figure 3.3. The aforementioned definitions of bandwidth all extend in a natural way to the passband 
case. Passband and baseband signals are related through upconversion and downconversion, as 
discussed in Sections 3.3.1 and 3.3.2. 


Example 3.16 


Consider a simplified PSD as shown in Figure 3.4, and compute the half power bandwidth, the noise 
equivalent bandwidth, and the fractional power containment bandwidth with a = 0.9. 


P(f) 


fe-2 fe-l fe fet fet2 — f [MHz] 
Figure 3.4 An example PSD for purposes of the bandwidth calculations in Example 3.16 


Answer: From Figure 3.4, we have fc. — f, — 1.5MHz, and fun = f, + 1.5MHz; thus 


Bsap = (fe + 1.5) — (fe — 1.5) = 3MHz. (3.78) 
We have P,(f,) = 1, and the integral can be computed as the area of the trapezoid 
1 


Baoise equivalent — F,7;. | P )d, = 3MHz: ‘3.09 
mea A me ey 


The total area of the PSD 1s 3, so for ao = 0.9, the area that should be left out is 0.3 on both sides or 
0.15 for one side of f,. This corresponds to the point f, — 2 + ,/0.3 on the left. Therefore, 


Brractional = (fe + 2 — V0.3) — (fe — 2 + V0.3) = 2.90MHz. (3.80) 


3.1.5 Sampling 


The fundamental connection between digital communication and DSP is through the sampling theorem. 
This results from the following fact: communication systems are bandlimited. Even ultra-wideband 
systems are bandlimited, just with a very large bandwidth. The bandlimited property means that the 
transmitted signal is bandlimited. Therefore, it is sufficient for the transmitter to generate a 
bandlimited signal. Because the channel is assumed to be LTT, and LTI systems do not expand the 
bandwidth of the input signals, the received signal can also be treated as bandlimited. This means that 
the receiver has to process only the signals found in the bandwidth of interest. 

As a consequence of the bandlimited property, it is possible to exploit the connection between 
bandlimited signals and their periodically sampled counterparts established by Nyquist’s theorem. 
The essence of this is described in the following (paraphrased from [248]): 


Nyquist Sampling Theorem Let x(7) be a bandlimited signal, which means that X(f) = 0 for f= fy. 


Then x(¢) 1s uniquely determined by its samples {x[n} = a(n'T’)}°c__.. ifthe sampling frequency 
satisfies 
1 ” 
fs = 7? 2fN, (3.81) 


where fx is the Nyquist frequency and 2f, is generally known as the Nyquist rate. Furthermore, the 
signal can be reconstructed from its samples through the reconstruction equation 
a 


t—ni 
olf > r[njsine ( = ) : (3.82) 


n=—0o 


Nyquist’s theorem says that a bandlimited signal can be represented with no loss through its samples, 
provided the sampling period T is chosen to be sufficiently small. Additionally, Nyquist shows how 
to properly reconstruct x(t) from its sampled representation. 
An illustration of sampling is provided in Figure 3.5. The values of the continuous signal x(t) taken 
at integer multiples of 7 are used to create the discrete-time signal x[7]. 
X-(t) 


(b) 
Figure 3.5 Sampling in the time domain. The periodic samples of x(t) in (a) form the values for the 
discrete-time signal x[] in (b). 


Throughout this book, we use the block diagram notation found in Figure 3.6 to represent the C/D 
(continuous-to-discrete) converter and in Figure 3.7 to represent the D/C (discrete-to-continuous) 


converter. The sampling and reconstruction processes are assumed to be ideal. In practice, the C/D 
would be performed using an ADC and the D/C would be performed using a DAC. ADC and DAC 
circuits have practical performance limitations, and they introduce additional distortion in the signal; 
see [271] for further details. 


y(t} —>| C/D yln] = ye(nT) 


‘i 


Figure 3.6 Sampling using a continuous-to-discrete-time converter 


x{nj——e| D/C x(t) = > x(n] sine (45"") 


n 


a 


Figure 3.7 Generation of a bandlimited signal using a discrete-to-continuous converter 


Through sampling, the Fourier transforms of x[”]| and x(t) are related through 
oO 


rr 1 l : 
x (e!?7F) = 7 > x (4 - *) (3.83) 


n=— oo 


While this expression might look odd, recall that the discrete-time Fourier transform is periodic inf 
with a period of 1. The summation preserves the periodicity with the result that there are multiple 
periodic replicas. 


If care is not taken in the sampling operation, then the resulting received signal will be distorted. 
For example, suppose that x(t) is bandlimited with maximum frequency fy. Then <(f) = 0 for |f > f/x. 


To satisfy Nyquist, 7 < 1/2f,. For f © [-1/2, 1/2) it follows that //T © [-1/2T, 1/27). Since fy < 


1/2T, then the sampled signal is fully contained in one period of the DTFT and there is no overlap 
among replicas in (3.83). Zooming in on just one period, 


x (el?*F) = — x (# for f € [—1/2,1/2). (3.84) 


Consequently, sampling of bandlimited signals by a suitable choice of 7 completely preserves the 
spectrum. Ifa small-enough 71s not chosen, then according to (3.83) the summations will result in 
aliasing and (3.84) will not hold. An illustration of sampling in the frequency domain is provided in 
Figure 3.8. Examples with sampling are provided in Example 3.17 and Example 3.18. For more 
details refer to [248, Chapter 4]. 


(C) 


Figure 3.8 Sampling in the frequency domain. (a) Magnitude spectrum of a continuous-time signal. 
(b) Corresponding magnitude spectrum for a discrete-time signal assuming that the Nyquist 
criterion is satisfied. (c) Corresponding magnitude spectrum for a discrete-time signal assuming 
that the Nyquist criterion 1s not satisfied. In this example fj > /\ and f, < 2/\. 


Example 3.17 
¢ Suppose that we wish to create the bandlimited signal 


a(t) = 3cos(2710°t + 7/4) + cos(6710°%t + 7/8). (3.85) 


Find the Nyquist frequency and Nyquist rate of x(‘). 
Answer: The Nyquist frequency of x(t) is fy = 3MHz because that is the largest frequency in 
x(t). The Nyquist rate of x(t) is twice the Nyquist frequency; thus 2A, = 6MHz. 


¢ Suppose that we want to generate x(t) using a discrete-to-continuous converter operating at five 
times the Nyquist rate. What function x[n] do you need to input into the discrete-to-continuous 
converter to generate x(t)? 


Answer: In this case T= 1/(5x3MHz) = (1/15)x10~s. Using the reconstruction formula in 
(3.82) and substituting, the required input is 


a(n] = 2(n(1/15) x 107°) (3.86) 
3cos(mn/15+7/4) + cos(an/5 + 7/8). (3.87) 


¢ Suppose that we sample x(t) with a sampling frequency of 4MHz. What is the largest frequency 
of the discrete-time signal x[n] = x(n7)? 


Answer: In this case T= 1/(4MHz) = 0.25 x 10~s. Substituting, 


a(n] = 2(n0.25 x 10~®) (3.88) 
= 3cos(27n/4+ 7/4) + cos(27n3/4 + 7/8). (3.89) 


The largest frequency in [—1/2, 1/2) is 1/4. 


Example 3.18 
Consider the signal 


a(t) = 40?sinc?(40zt). (3.90) 


Suppose that x(¢) is sampled with frequency f, = 60Hz to produce the discrete-time sequence a[n] = 


x(nT) with Fourier transform A(e?). Find the maximum value of fo such that 


on 1 , 
A(el**!) = 7% (+) for |f| < fo, (3.91) 
where x(f) is the Fourier transform of _ 
Answer: The sampling period 7’ = ;-- From Table 3.2 the Fourier transform of x(t) 1s 
er l f —_— 
= eT 3.92 
x) = 7 @ Lae 


The maximum frequency, that is, the Nyquist frequency, of x(t) 1s fy = 40Hz, and thus the Nyquist rate 
is f, = 80Hz, which is larger than the sampling rate of 60Hz. Consequently, there will be aliasing in 
the discrete-time frequency range of [—1/2, 1/2). 

The sampled signal a[n] has transform 


A(e!2*f) = 7 x (Z _ =) (3.93) 
k=—0o 

= 60 5? 2 (60f — 60k). (3.94) 
k=—0c0 


To solve this problem, notice that there 1s overlap only between adjacent replicas, because fy < 1/T < 
f,. This means that inside the range f © [—1/2, 1/2), 


A(e!?*S) = 60 x(60f + 60) + 60 x(60f) + 60 x(60f — 60). (3.95) 


Looking at positive frequencies, the replica X (60f— 60) extends from f= (f,fy)/f, = (60 — 40)/60 = 
1/3 to (f,/2)/f, = 1/2. Thus fp = 1/3 and 


A(e!?*Ff) = 60 x(60f) for |f| < 1/3. (3.96) 


To aro the sampling operation, it often helps to examine the proxy signal 
z(t) =< (t) a [11 (¢/7'), where u(t) =), 0(¢ — k) is the comb function with unit period. The unit 
period comb Aton satisfies m(f) = F'{1(t)}, so the same function is used in both time and 
frequency domains. Example 3.19 shows how the proxy signal can be used to provide intuition about 
aliasing in the sampling process. 


Example 3.19 
Consider a continuous-time signal x(t) with transform x(f) as indicated in Figure 3.9. The transform is 
real to make illustration easier. 


¢ What is the largest value of 7 that can be used to sample the signal and still achieve perfect 
reconstruction? 


Answer: From Figure 3.9, the highest frequency is 1000Hz; therefore 7 = 1/2000. 
xf) 


500Hz l000Hz sf 
Figure 3.9 The Fourier transform of a signal. The transform is real. See Example 3.19. 


¢ Suppose we sample with 7’ = 1/3000. Illustrate x(f) and x,(f) in the interval —1/27 to 1/27 and 
x(e?") in the interval —1/2 to 1/2. 
Answer: The transform is computed as 


and is plotted in Figure 3.10. 


—SWHz S00Hz lWwOHz ff 


1(2T) = 1S00Hz 


—1500Hz —1000Hz 1O00Hz 2000Hz 3000Hz 


~1/2 ~1/3 


Figure 3.10 Connecting x(f), x,(f), and x(e?™) for T= 1/3000 in Example 3.19 


* Suppose we sample with 7 = 1/1500. Illustrate x(/) and x,(f) in the interval —1/2T to 1/2T and 
x(e?%) in the interval -1/2 to 1/2. 
Answer: The transform is plotted in Figure 3.11. 
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Figure 3.11 Connecting (/), x,(f), and x e?"/) for T= 1/1500 in Example 3.19 
¢ Write an equation for x(f). 


Answer: From Figure 3.9, x(f) is a superposition of a rectangular pulse and a triangular pulse. 


Based on the figure, 
f f — 500 
— act | —— —_—_— } |, 3.98 
x(f) = 10 rect € +A =00 (3.98) 


* Determine x(t) given <(f) using the Fourier transform table and properties. 
Answer: 


a(t) = 10 [1000 sinc(1000t) + 500 sinc* (5008) e779") 


(3.99) 
¢ Determine x[n] for T= 1/3000. 
Answer: Substituting 7 = 1/3000 into x[n] =x(nT), we have 
x(n] = 5000 2 sinc (=) + sine? (=) elm] | (3.100) 
d ) 


¢ Determine x[] for T= 1/1500. 
Answer: Substituting 7 = 1/1500 into x[n] =x(1T), we have 


- : 2n 9 f PY) aE ae 
z|n| = 5000 2 sine (F) + sinc* (=) e's | (3.101) 


3.1.6 Discrete-Time Processing of Bandlimited Continuous-Time Signals 


Digital signal processing is the workhorse of digital communication systems as explained in Chapter 
1. This section presents an important result on filtering continuous-time signals using a discrete-time 
system. This observation is exploited in Section 3.3.5 to develop a fully discrete-time signal model 
for a communication system. The steps of the derivation are summarized in Figure 3.12. 


(a) 


x[n] ——> ital yin] 


(c) 

Figure 3.12 (a) A continuous-time LTI system with a bandlimited input; (b) generating a 
bandlimited input using a discrete-to-continuous converter, followed by processing with an LTT 
system, followed by sampling with a continuous-to-discrete converter; (c) the discrete-time 
equivalent system 


We start by explaining the implications of processing a bandlimited signal by an LTT system. Let 
x(t) be a bandlimited signal input into an LTI system with impulse response /,(t), which is not 
necessarily bandlimited, and let the output of the system be y(t). The input, output, and system are in 
general complex because of complex envelope notation (see Sections 3.3.1 and 3.3.2) and the concept 
of the baseband equivalent channel (see Section 3.3.3). The fact that x(t) is bandlimited has three 
important implications: 


1. The signal x(t) can be generated in discrete time using a discrete-to-continuous converter 
operating at an appropriate sampling frequency. 

2. The signal y(t) is also bandlimited since x(f) is bandlimited and y(/) = h,(/)x(/). Therefore, 
y(t) can be processed in discrete time using a continuous-to-discrete converter operating at an 
appropriate sampling frequency. 

3. Only the part of the channel h,(/) that lives in the bandwidth of <(f) is important to y(/). 


These facts can be used with additional results on sampling to obtain a relationship between the 


sampled input, the output, and a sampled filtered version of the system. 
Suppose that x(t) has a bandwidth of fy = B/2. For this derivation, absolute bandwidth is assumed. 


The sample period Tis chosen to satisfy 7 < 1/B. Define the portion of the channel that lives in the 
bandwidth of x(t) as 


hiow(f) = rect(f/B)h-(f), (3.102) 
where rect(//B) is an ideal lowpass filter with bandwidth of B/2. Using Table 3.2, in the time domain, 
Right) = Bf sinc(7B)h,(t — r)dr. (3.103) 


With these definitions, because x(t) and y(t) are bandlimited with bandwidth B/2, 
y(f) = hiow(f)x(f). (3.104) 


The filtered impulse response /,,,(¢) contains only the part of the system that is relevant to x(t) and 
y(t). 
Now consider the spectrum of y[7]. Referring to (3.84), 


y(e!?"F) = 7(4) for f € [—1/2,1/2) (3.105) 
l 
— hom & (7) for f € [—1/2,1/2). (3.106) 
Substituting for x(e?”/) from (3.84), 
y(el?"F) = how (4) x(el?"F) for f € [—1/2,1/2) (3.107) 
- 1 Thy J x (e)?"F) for f € [—1/2,1/2). (3.108) 
1 ie ow T 5 — ’ . 
Now suppose that 
h(f) = Thiow(f). (3.109) 
Then 
joan f 1 f Jen f : P ‘ ‘ ‘ 
y(e"™7) = —h{ — | x(e*’™) ~— for _f € [—1/2,1/2). (3.110) 
xy he 
Again using (3.84), 
y(e!?"4) = h(e?"F) x(el?*F) for f € [—-1/2,1/2). (3.111) 


Therefore, it follows that the discrete-time system that generates y[n] from x[7] is 


h{n| = Thiow(nT) (3.112) 
=i / sinc(TB)h.(nT' — 7)dr (3.113) 
— | sinc(TB)h.(nT — r)dr. (3.114) 


The discrete-time equivalent h[n] is a smoothed and sampled version of the original impulse 
response A(t). We refer to h[n] as the discrete-time equivalent system. An example calculation of the 
discrete-time equivalent of an LTI system that delays as input is provided in Example 3.20. 


Example 3.20 

Suppose that an LTT system delays the input by an amount t,. Determine the impulse response of this 

system and its discrete-time equivalent assuming that the input signal has a bandwidth of B/2. 
Answer: This system corresponds to a delay; therefore, h,(t) = 6(¢ — tz). Applying (3.114), the 

discrete-time equivalent system is 


h|n| = [ sinc(rByhe(n — T)dr (3.115) 
= / sinc(7B)é(nT — tq — T)dt (3.116) 
= sinc(nBT — Brg) (3.117) 
= sinc(n — Bry). (3.118) 


It may seem surprising that this is not a delta function, but recall that t, can take any value; if T, is an 
integer fraction of 7, then Br, will be an integer and h[n] will become a Dirac delta function. 


3.2 Statistical Signal Processing 


Randomness is a component of every communication system. The transmitted information i[7], the 
effects of the propagation channel A(t), and the noise n(t) may all be modeled as random signals. For 
purposes of system design and analysis it is useful to mathematically characterize the randomness in a 
signal using tools from probability theory. While this is a deep area, the class of random signals 
required for wireless communication is sufficiently basic and is summarized in this section, with an 
emphasis on application of probability tools, leaving the appropriate proofs to the references. 


3.2.1 Some Concepts from Probability 

Consider an experiment with outcomes that are generated from the sample space S= {@, @, .. .} 
whose cardinality is finite or infinite according to some probability distribution P (wi), To each 
outcome w, from this space is associated a real number x(@), which is called a random variable. A 
random variable can be continuous, discrete, or mixed in nature and is completely characterized by 
its cumulative distribution function (CDF). For continuous-valued random variables let us use the 
notation F..(a@) = : ome f,(z)dz to denote the cumulative distribution function where the probability 
that x is less than or equal to a is P(t < a) = Fr(a ) and f(z) denotes the probability distribution 
function (PDF) of x. The PDF should satisfy f(z) = 0 and I, f.dz = 1. For discrete-valued random 
variables, let y denote the ordered set of outcomes of the random variable indexed from — to 0, and 
let p,[m] denote the probability mass function (PMF) of x where the index m corresponds to the index 
of the outcome in y. Then p,[m] is the probability that the value with the m" index in y is chosen. Note 
that p,[m] = 0 and >'m p,[m] = 1. It is possible to define joint distributions of random variables in 
both the continuous and discrete cases. 


The expectation operator Ex || js used in various characterizations of a random variable. In its 
most general form, for a continuous-valued random variable and given function g(x), 


E, {g(x)| = / 9(z) fa(z)dz. (3.119) 


The expectation operator satisfies linearity; for example, Ey (g(x) + h(x)] = E,[9(x)] + Ez[h(x)| 
. The mean of a random variable 1s 


me = E,(z| (3.120) 
= a pha (3.121) 
The variance of a random variable is 
o- =E [|x — Mr | (3.122) 
=E ||a 7] —|m,|? (3.123) 


The variance is nonnegative. 
Random variables x and y are said to be independent if F,, ,(a, B) = F.(a)F,(B). Essentially the 
outcome of x does not depend on the outcome of y. Random variables x and y are said to be 


uncorrelated if 2x,y|24) = 9. Be careful: being uncorrelated does not imply independence except in 
certain cases, for example, ifx and y are Gaussian. 


Example 3.21 
Suppose that f(x) = 1/2 for x © [0, 2) and is zero elsewhere. Compute the mean and variance of x. 


Answer: The mean is 


E,[z] = | zf,(z)dz (3.124) 
( 
. 
= 0.52dz (3.125) 
wv 0 
= 0.252°|¢ (3.126) 
= 0.25(4 — 0) (3.127 
=] (3.128) 
Now computing the variance: 
o2 =E., [|2|7] — |mz|* (3.129) 
9 
= | z° fx(z)dz —1° (3.130) 
JO 
“12 wer 
= ~z°dz—1 (3.131) 
2 
Jo 4 
Eee Pe seas 
= =25/% —1 (3.132) 
6 
er 
= g (8 —Q)-1 (3.133) 
j 
= 8/6 — 6/6 (3.134) 
= 1/3. (3.135) 


3.2.2 Random Processes 


A random or stochastic process is a mathematical concept that is used to model a signal, sequence, or 
function with random behavior. In brief, a random process 1s a probabilistic model for a random 
signal. It is a characterization of an ensemble of random signals, not just a single realization. Random 
processes are used in diverse areas of science, including non-wireless applications such as video 
denoising and stock market prediction. This book primarily considers discrete-time random 
processes, though it does require description of the continuous-time case as well for the purpose of 
defining the bandwidth of the transmitted signal. 


Now we extend the definition of random variable to a process. Suppose that each outcome is 
associated with an entire sequence {x [n, wi]} ea g according to some rule. A discrete-time 
random process 1s constituted by the sample space S, the probability distribution Plu; | and the 


sequences { x [n, w;]| , . Each sequence {x [n, Ww; ae an is called a realization or sample path 


n=—oo 


and the set of all possible sequences {7 {n, w,;)}°* 


nRi=—ooO 


description of a random process 1s given in Figure 3.13. 


, @; © Sis called an ensemble. The ensemble 
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Figure 3.13 Ensemble description of a random process in discrete time. Each outcome 
corresponds to the realization of a signal. 


Random processes can also be visualized as collections of random variables at different points in 
time. We say that x [n, m] is a random process if for a fixed value ofn, say 7», = 7, x [n, w] isa 
random variable. Having understood the connection with the sample space S, we eliminate the 
explicit dependence on w from the notation and pursue the notion that a discrete-time random process 
is essentially a collection of indexed random variables where n is the index and x[7] is the value of 
the random variable at index n. 

To simplify the exposition, suppose that the random process x[] is continuous valued. In this way, 
we suppose that x[] has a well-defined PDF. 


Random processes are fully characterized by knowledge of their joint probability distributions for 
all orders. When only some joint distributions are known, this leads to a partial characterization. For 
example, the first-order characterization of x[7] is knowing the CDF F,,,,)(a), or equivalently the PDF 


fin), for all n. The second-order characterization of a discrete-time random process is knowing the 
joint distribution between x[n,] and x[/] for any choice of mj and ny: Fyn x{n,{(O1> 2). Of course, 
the second-order characterization includes the first-order characterization as a special case. Higher- 
order characterizations can be defined in a similar fashion. 


An important class of random processes consists of those that are independent identically 
distributed (IID). Such processes satisfy two important properties. First, all of the collections of 


random variables (x[”,], x[”>], ...,x[n;,]) for k > 1 are independent. This means that for any m < k, 
Fy {ny),2[n2] eae [rm] (1 »AQ,---, Om) = Fyiny] (a1) Feng] (a) Sore Fy inin] (Qn) : (3.136) 


Second, the random variables are identically distributed, thus 
Fyiny] (Ok) = Fr (Ox) (3.137) 
for all x[n,], x[m>],...,x[n,], and 
Fryfny),a[na],---5¢[m](A1; @2) ++; Qm) = Fy (a1) Fr (@2)-++ Fz (Qm)- (3.138) 


We often model the transmitted data and additive noise as IID. 


Example 3.22 
Find a random process that models an independent sequence of equally likely bits. 

Answer: Let i[n] denote the sequence of bits. Because the process is independent, each i[7] is 
generated independently. As the distribution is the same for each bit, it is also identically distributed. 
Let i[n] be a discrete random variable with two possible outcomes: Plt = 0) = 1/2 ana 
Plt = 1] = 1/2, Denoting the set of outcomes y = {0, 1} with index set {0, 1}, the random process is 
completely characterized by the first-order probability mass function p;[0] = 1/2 and p[1] = 1/2. 


3.2.3 Moments of a Random Process 


The concept of expectation can be extended to provide another characterization of random processes 
based on their moments. The mean of a random process is 


Mz[n] = Ezfny [z[n]] (3.139) 
= [ zfeinj(z)dz. (3.140) 
J —oOo 


The mean is in general a function of 7 since the distribution may vary as a function of n. The 
correlation of a random process is 


Rez[N1,N2] = Eziny),z{n2] [Z{N1]2* [Na] (3.141) 
30 "OO 
= / | 2129 fe{ni),2[n2] (21, z9)dz,dz9. (3.142) 
The covariance of a random process generalizes the concept of variance: 
Crz[N1, M2] = Egin,),x{ng] [e[r1]x* [na] — Meng] ™z[n9]° (3.143) 


If the process is zero mean, C,,[1), 15] = R,,["1, 12]. Most processes in this book are zero mean. The 
covariance function is conjugate symmetric, that is, C,,.,. (m4 ng] =, (no, ny } This is also true of 
the correlation function. The variance is 


a? [n] = Crz|n, n] (3.144) 
and is a measure of the power of a random signal. 
A random process is called uncorrelated if 


o7(n4], on = ny . ee 
Crz[N1,N2] = a : u P = (3.145) 


Note that the uncorrelated term is applied to the covariance function and not (surprisingly) to the 
correlation function. 


A random process is called orthogonal if 


Einjlz{nil]|?, mi =n ee 
Rasim =| a, a bh (3.146) 


An uncorrelated zero-mean process is also an orthogonal process. 
The cross-correlation between two stochastic processes x[n] and y[7] 1s 


Rey|ni,N2] = Ezfny),y{na] [e[ra}y* [na]. (3.147) 


Analogously, the covariance between x[n] and y[] is 


Cry|n1, M2] = Ezin,},y{n2) [((2[21] — Mz[n1])(ylna] — my|[na])*]. (3.148) 


3.2.4 Stationarity 
Stationarity 1s a characterization of a random process. A random process is stationary of order N if 


F s(ni),e[na] ear r[n we] (O15 OQ +++ an) = Fein +k],c[no+k] Lamad t[nn +k] (a1, OlQ,-++. ,an) (3.149) 


for all k. A process that is stationary for all orders N, 1, 2,...is called strict-sense stationary 
(SSS). Perhaps the most important class of random processes that we deal with are WSS random 
processes. 


WSS is a particular type of stationary that is useful for random systems. A WSS random process 
x[n] satisfies these criteria: 


* The mean is constant m,,,,;[n] = m,. 


¢ The covariance (or equivalently the correlation) is only a function of the difference 


has in4, n| — See [ne _ ny]. (3.150) 


No assumptions are made about joint probability distributions associated with a WSS random 
process. By construction, an IID random process is also WSS and any random process that is 
stationary of order N> 2 1s WSS. 


A similar definition of WSS applies for continuous-time random processes. A WSS random 
process x(t) satisfies these criteria: 


¢ The mean is constant: 


mz (t)(t) = / z fr) (z)dz (3.151) 
7 is (3.152) 


¢ The covariance (or equivalently the correlation since the mean is constant) is only a function of 
the difference: 


Con (ti, t2) = Coz (to —t; }. (3.153) 


Estimation of the mean and covariance for continuous-time random processes in practice is 
challenging because a realization of x(t) for all values of ¢ 1s not necessarily available. Consequently, 
x(t) is usually sampled and treated as a discrete-time random process (DTRP) for experimental 
purposes. 


There are several useful properties of autocorrelation function R,.[”] = R,..[n2 — |] WSS random 
processes: 
1. Symmetry: The autocorrelation sequence of a WSS random process x[7] is a conjugate 
symmetric function: 


Rzz|[n] = Rp,[—n]. (3.154) 


When the process is real, the autocorrelation sequence is symmetric: R,..[n] = R,,[—n]. 


2. Maximum Value: The magnitude of the autocorrelation sequence of a WSS random process 
x[n] 1s maximum at n = 0: 


Rex {n| < Rex [0], Vn. (3.155) 


3. Mean Square Value: The autocorrelation sequence of a WSS random process at n = 0 is equal 
to the mean square value of the process: 


Feex(0] = El|x(n]!"l, (3.156) 


We conclude the discussion of WSS with some example calculations involving computing the mean 
and covariance of signals and checking if they are WSS. We present classic examples, including a 
moving average function in Example 3.23, an accumulator in Example 3.24, an autoregressive 
process in Example 3.25, and a harmonic process in Example 3.26. 


Example 3.23 
Suppose that s[] is a real, IID WSS random process with zero mean and covariance 


Balk = o2 6k}. Suppose that x[”] = s[n] + 0.5s[n — 1], whichis an example of what is called a 
moving average process. Compute the mean and covariance of x[n], and determine if itis WSS. 
Answer: The mean is 
E,|z[n]] = E, [s[n] + 0.5s[n — 1]] (S.1E 
= 0 (3.158 
since s[n] is zero mean and the expectation operator is linear. 


The covariance is the same as the correlation function, since the process is zero mean. Computing 
the correlation function, 


Rrx{N1,n2| = Es [z[n1]x*[na]] (3.159) 
= E, |(s[ni] + 0.5s[ny — 1]) (s[ng] + 0.5s[n2 — 1])"] (3.160) 
= KE, [s[n1]s* [ng] + 0.58[ni]s*[n2 — 1] 
+0.5s {my ~IJs{na] + 0.25s[r — I]s"[na ~ 1) (3.161) 
= R,,[n2 — n1] + 0.5R,.[n2 — n1 — 1] 
+ 0.5Rs5[n2 —m + 1] +0.25Rs5[n2 — 1] (3.162) 


= 1.25Rss[n2 — m1] + 0.5Rs5[no —n1 — 1] +0.5Rs.[no2 —n1 +1]. (3.163) 


The correlation is only a function of the difference n, — n,; thus x[n] is WSS and the correlation can 
be simplified by substituting k =n, —n,: 


Rez{k] = 1.25Rs5[k] + 0.5Rs5[k — 1] + 0.5R55[k + 1]. (3.164) 
Substituting the given value of R,.[], 


Rex [k] = 1.25026[k] + 0.5026[k — 1] + 0.5076[k + 1]. (3.165) 


Example 3.24 


Toss a fair coin at each n, —c00 <n < o and let 


bol 


ai= ! +1, if heads is the outcome, P(H (3.166) 


—1, if tails is the outcome, P(7') = 


w[n] is an IID random process. Define a new random process x[n]; 1 = 0 as 


x{0] = w(0] (3.167) 
x{1] = w[0] + w{1] (3.168) 


Th 


z(n|] = » & wii]. (3.169) 


i=0 


This is an example of an accumulator. Determine the mean and covariance of x[n] and determine if it 
is WSS. 


Answer: The mean of this process 1s 


E,[z[n]] = > — Ew[wlil] (3.170) 
i=0 
= 0. (3.171) 


The covariance is the same as the correlation function, since the mean is zero. We can assume ny > 1, 
without loss of generality and compute 


Rzz|N1, N2] = Ey [x[ny]x*[ng]] (3.172) 


(5° wil (3° vt | : (3.173) 
i=0 k=0 


To simplify the computation, we exploit the fact that n, <n, and break up the second summation: 


= E,, 


My Thy Thy 9 
Rez[ni,n2] = >) > Ew [wfijw* [J] + 45 SY) Ew [ewlijw* [Al]. (3.174) 
i=0 k=0 i=0 k=n,+1 


Note that the expectation of the second term is zero since the random process is independent and zero 


mean. To see this, note that for two independent random variables x and y, E,y|ry| = E,|2] E,|y) 
To compute the expectation of the first term, we factor out the square products: 


WY Thy] hy} 
Ryx[n1,N2] = 2, Fe [| ‘wii 17] +- >, 2, Ey [w{i]w*[k]] . (3.175) 
i=0 k=0,i4k 


Notice that again the second term is zero. Because w[n] is IID, 


Rzz[n1, Ng] = (ny + 1)E,, [|wfi]|?] (3.176) 
= (ny, +1) (507 alr 5(-1)*) (3.177) 


= (nm +1). (3.178) 


Since the correlation is not a function of the difference n, — n,, the process is not WSS. This process 
is commonly referred to as a random walk or a discrete-time Wiener process. 


Example 3.25 
Let x[n] be a random process generated by 


z|n] = az[n — 1] + w[nl, (3.179) 


where n > 0, x[-1] = 0, and w[] is an IID (j:,,,, a2) process. We can express the process more 
conveniently as 


x{0| = ax[—1] + w[0] = w/0] (3.180) 
x{1] = ax[0] + w[1] = aw/0] + wl] (3.181) 
a(n] = a" w/[0] + a”! wil] + a2 w[2] +... + w[n] (3.182) 
=) a” ‘ufil. (3.183) 

i=0 


Determine the mean and covariance of x[n] and determine if itis WSS. 
Answer: The mean of this process 1s 


nm 


Ey [2[n]] = > — a” *E,, [w[i]] (3.184) 
i=0 
rh 
= ly y gr (3.185) 
i=0 
= woar S- ay’. (3.186) 
i=0 


For a= 1, Ewle[n]] = Hw(n + 1) whereas a# 1, 

Ew[z{n]] = Hwa" . =a. 7), el = '). Hence, for 1, # 0, the mean depends on n and the 
a —a 

process is not stationary. For t= 0, however, the mean does not depend on n. 


For u,, # 0 and |a| < 1, we observe an interesting phenomenon as n — oo. The asymptotic mean is 


lim E,[x[n]] = = 


n—0o end 


(3.187) 


Random processes with this property are said to be asymptotically stationary of order 1. In fact, one 
can show that the process is asymptotically WSS by studying lim, _,,, Ry,[7, "2] and showing that it 


is a function of only 1, — n). 


Example 3.26 
Consider a harmonic random process x[n] = cos(2mf,n + >) where is a random phase that is 
uniformly distributed on [—a, 2]. This process is corrupted by additive white Gaussian noise 
v[n] ~N ( 0. o?) that is uncorrelated with x[n]. The resulting signal y[n] = x[n] + v[n] 1s available 
to the receiver. Compute the mean, covariance, and power spectral density of y[n]. Is y[n] a WSS 
random process? Suppose that x[n] is the desired signal and v[7] is the noise. Compute the SNR, 
which is the ratio between the variance in x[n] and the variance in v[7]. 

Answer: The mean of x[7] is 


TT 


1 | 
E |cos(27 fon + ¢)| = | ; cos(27 fon + x)dx (3.188) 


—T Zit 


= 0. (3.189) 


The covariance of x[n] 1s the same as the correlation function since x[”] is zero mean: 


Rzz|[n,n + | = E [cos(2a fon + $) cos(2afo(n + £) + )] (3.190) 
= | 

= / cos(2z fon + x) cos(2af,(n + £) + x)dax (3.191) 
J—nT =! 


cos(27 foe) 


§ 
2 


(3.192) 
which is not a function of n and thus can be written R,,.[ 0]. Hence Ris 6] — Fsald] +076 if}. The 
power spectral density of y[n] is simply the DTFT of R,,[€] (discussed in more detail in Section 
32.0): 


9) : l = ” . l = 9 S 
P, (e\2*f) = > Gua fo +m) + SUF + fo +m) ) +07. (3.193) 


m=—co 


Because it is zero mean and the covariance 1s only a function of the time difference, y[”] is a WSS 
random process. 


The SNR is 


——- (3.194) 


The SNR is an important quantity for evaluating the performance of a digital communication system, 
as discussed further in Chapter 4. 


3.2.5 Ergodicity 


One challenge of working with measured signals is that their statistical properties may not be known 
a priori and must be estimated. Assuming a signal is a WSS random process, the key information is 
captured in the mean and the correlation function. Applying a conventional approach from estimation 
theory, the mean and correlation can be estimated by generating many realizations and using ensemble 
averages. Unfortunately, this strategy does not work if only one realization of the signal is available. 


Most WSS random processes are ergodic. To a signal processing engineer, ergodicity means that 
one realization of the random process can be used to generate a reliable estimate of the moments. For 
WSS random processes, the main message 1s that the mean and correlation function can be estimated 
from one realization. With enough samples, the sample average converges to the true average of the 
process, called the ensemble average. 


Ergodicity is a deep area of mathematics and physics, dealing with the long-term behavior of 
signals and systems. Necessary and sufficient conditions for a WSS random process to be mean or 
covariance ergodic are provided in various books—see, for example, [135, Theorems 3.8 and 3.9 ]— 
but discussion of these conditions is beyond the scope of this book. The WSS random processes 
considered here are (unless otherwise specified) assumed to be covariance ergodic, and therefore 
also mean ergodic. 


The mean and covariance of a covariance-ergodic random process can be estimated from the 
sample average from a realization of the process. With enough samples, the sample average 
converges to the true average of the process. For example, given observations x[0], x[1],...,x[N— 
1], then the sample average 


N-1 
S : | 
fie = vim N >. x(n] (3.195) 
n=0 
converges to the ensemble average Elx(n]] and 
_ 1 N-1-k 
Reolk] = lim y= > x{nja*[n + ki] (3.196) 


n=O0 


converges to the ensemble correlation function E [a|nja*[n + kj] Convergence is normally in the 
mean square sense, though other notions of convergence are possible. 

Assuming that a WSS random process is ergodic, the sample estimators in (3.195) and (3.196) can 
be used in place of their ensemble equivalents in subsequent signal processing calculations. 


Example 3.27 

Suppose that x[”] 1s real, independent and identically distributed with mean m,. and covariance 
Cx. [k] — a h) [k}. Show that this process is ergodic in the mean, in the mean square sense. 
Essentially, prove that 


2 
jim E “ (Fe N ? x|n| _ =0) 


: 1 N= 
Answer: Let 77, ,, = K Wie 


5 | n}- First we find the mean of the random variable my, 


E|[my] =E 


= Nw: 


Second, we find the mean of the squared sample average 


E [my | =E 


Then the mean squared error is 


E (mn — mz)? | =E [my — 2mym, + m2 


_ wi ap 


( 
N n=0 


N—1 


>, z|n] 


f—1 N—1 


(x 


) ) H bs (m): r[n] 


m=0 n=0 


N-1 


1 
=a5( DE 


n=0 


a[n]?] 


+ S° E[e| [z[n]a[m]] 


nzm 


—E [z[n]?] — m2 + m2 


2 
+e. 


=— [mx] — 2m,E [myn] + mz 


aS E\z|n] 


n=0 


N-1 


(3.197) 


(3.198) 


(3.199) 


(3.200) 


(3.201) 


(3.202) 


(3.203) 


(3.204) 


(3.205) 


(3.206) 


(3.207) 


(3.208) 


Q 
bh 


- = + m* 2 — Im? +m? (3.209) 


Ss 


a7 


= 2. (3.210) 


ce 


‘= 


As long as a 2 is finite, the limit as N — © 1s zero. Therefore, any IID WSS random process with 
finite variance is ergodic in the mean, in the mean square sense. 


3.2.6 Power Spectrum 


The Fourier transform of a random process is not well defined. When it exists, which it normally 
does not, it is not informative. For WSS random processes, the preferred way to visualize the signal 
in the frequency domain is through the Fourier transform of the autocovariance function. This is called 
the power spectrum or the PSD. Loosely, it is a measure of the distribution of the power of a signal as 
a function of frequency. The power spectrum of a random process is used for all bandwidth 
calculations instead of the usual Fourier transform. 


For continuous-time random processes, the power spectrum is defined using the CTFT as 


Pie | ~ Con (tet, (3.211) 


For discrete-time signals, it is defined in a similar way using the DTFT as 
P, = So Casnjere7" (3.212) 
n=— oo 


In both cases the power spectrum is real and nonnegative. The real property follows from the 
conjugate symmetry of the covariance, for example, C’,,./n] = C*,.{—n|. 


The total power in a random signal is the sum of the power per frequcticy over all frequencies. 
This is just the variance of the process. For continuous-time random processes, 


o? = Czz(0) (3.213) 


- | P,(f)df, (3.214) 
whereas for a discrete-time random process, 
o* ='C,<(0] (3.215) 
1/2 
= | P, (e?"F) oy. (3216) 
A een ke 


Examples of PSD calculations are provided in Example 3.28, Example 3.29, and Example 3.30, using 
direct calculation of the Fourier transform. 


Example 3.28 


Find the power spectrum for a zero-mean WSS random process x(t) with an exponential correlation 
function with parameter B > 0: 


Ryz(t) =e *F!41 st E (—00, 0). (3.217) 


Answer: Because the process is zero mean, computing the power spectrum involves taking the 
Fourier transform of the autocorrelation function. The answer follows from calculus and 
simplification: 


oo 
P,(f) = | Reatye os “dt (3.218) 
FJ — OO 
8OO 
=| e 25ltl e—j2a ft ay (3.219) 
— OO 
0 oO ; 
= | e(2O-J2r ft ge 4 / e~ (28—-j2rf)t ay (3.220) 
=00 0 
1 1 
ee er 3.221 
20—j2nf 26+ j2rf ( ) 
a. 2 (3.222) 
432 + An? f2 
= : (3.223) 


Example 3.29 
Find the power spectrum of a zero-mean WSS process x[n] with C,,[”] = a!", |a|<1. 


Answer: Because the process is zero mean, computing the power spectrum again involves taking 
the Fourier transform of the autocorrelation function. The answer follows by manipulating the 
summations as 


‘> ©) 
}27 Y —j27 ] 990, 
P(e a) = ) Crz[nle x0 (3.224) 
n=—0o 
OO 
= ) ql es2nfn (3.225) 
n=— 00 
= 3 qe l2nfn £5 gis g—lanfn (3.226) 
Nl=— 0O n=1 
OO (> ©) 
=) ater Sate rm (3.227) 
m=0 n=0 
1 i | 
= ——_—_.__. + ——__—____ -_ ] secs 
1 — ael27f 7 1 — ae—i27f ( ) 
1 -—a? 
= ie 3.229) 


~ 1+? — 2acos(2rf)’ 


Example 3.30 
A discrete-time harmonic process is defined by 


M 
eja] = ape ntene? (3.230) 


where {(ax, fir) i . , are given constants and { 4, } jel , are IID uniformly distributed random 


variables on [0, 27]. Using Eel] to denote E {ox}, '-|, the mean of this function is 


M 

Eg[x[n]] = Exjnj | >, aaa (3.231) 
k=1 

= » anEg, [ler Fen+on)) (3.232) 

=u (3.233) 


which follows from 


ar 
- * | 
Ey, (ere to)] — jd RRR dig (3.234) 
0 47 
el(2r fin} Qn 
= e*dz (3.235) 
27 Jo 
=(0, Wk (3.236) 


The autocorrelation function of the signal is given by 


M M i 
Fee] = Ky aa a,erfi - (>: ap,el(2nfe(nt+O)+or } | (3.237) 
i=1 k=1 
M 
k=] 
M M 
at x, >». aja,E¢ eatin )e—I27 fe (n+£)+O% | (3.238) 
i=1 ki 
= >? Jax |2eIP7 FO +4 0 (3.239) 
= 5 lax |2e Pt FeO | (3.240) 
k=1 


Since the autocorrelation is a sum of complex exponentials, the power spectrum is given by 


P(e") = y lax|?5(f — fr): (3.241) 


=1 


When the PSD of an ergodic WSS random process is unknown, it can be estimated from one 
realization. This operation is performed by a spectrum analyzer ina lab. There are several 
approaches to constructing the estimator. One approach is to use some variant of a periodogram. The 
windowed periodogram involves directly estimating the power spectrum P,, (e?"/) from N samples of 
available data {a n]}5- a with window function w[n], =0,..., N— 1 defined as 


N-1 
~ er 1 pa 
P, (e?7/) = = S> w[n|z[nje?7/"| . (3.242) 


n=0 


This method is simply referred to as the periodogram when the windowing function is chosen to be a 
rectangular function. For ease of implementation, the DFT can be used to compute the windowed 
periodogram P, (el27f) at frequencies f,, — & pt =O, N= 


N-1 


~ i , 197 kn | 
P,|k] = WV > w(n|a[nje 2" s |. (3.243) 


n=0 


The resolution of the resulting estimators is inversely proportional to NV. Different window functions 
can be used to achieve different trade-offs in main lobe width (which determines the resolvability of 
close sinusoids) and sidelobe level (spectral masking). 


The periodogram and the windowed periodogram are asymptotically unbiased but are not 
consistent estimators. This means that the variance of the error does not go to zero even with a large 
number of samples. A solution to this problem is to use Bartlett’s method (periodogram averaging) or 
Welch’s method (averaging with overlap). With Bartlett’s method, for example, a measurement is 
broken into several pieces, the periodogram is computed in each piece, and the results are averaged 
together. The resolution of the estimator decreases (since the transform is taken over a fraction of N) 
but the consistency of the estimator is improved. Alternatively, an estimate of the covariance function 
{Crz[(n]} a 1 for L < N samples can be used (possibly with additional windowing) and the Fourier 
transform of those samples taken in what is called the Blackman-Tukey method. All of these methods 
yield consistent estimators with somewhat more complexity than the periodogram. More information 
on spectrum estimation is available in [143, Chapter 8]. 


3.2.7 Filtering Random Signals 


WSS random processes will be used frequently in this book. Because filters are simply LTI systems, 
itis of interest to establish the effects of an LTI system on WSS random inputs. In this section, we 
characterize the mean and covariance of filtered WSS signals. 


First consider an example that requires a brute-force calculation. 


Example 3.31 


Consider a zero-mean random process x[7] with correlation function R,,[”] = (1/2)!"!. Suppose that 
x[n] 1s input into an LTT system with impulse response h[n] = d[n] + 0.5 d[n — 1] — 0.5 d[n — 2]. The 
output is y[n]. Compute the mean and correlation of y[7]. 


Answer: The output of the system is 
y(n] = z[n] * Alin] (3.244) 
= z[n] + 0.52[n — 1] — 0.5a2[n — 2]. (3.245) 
First, we compute the mean of y[7], 
E[y|[n]| = E[a[n]] + 0.5E[a2[n — 1]] — 0.5E[2[n — 2]] (3.246) 
mi (3.247) 


where the first equality follows from the linearity of expectation and the second equality follows from 
the fact that x[7] 1s zero mean. 


Second, we compute the correlation function, 


Ryy|m,m +n] = Ely|[m]y*[m + n]] (3.248) 
= E[(a2[m] + + 5am ~ ' — sem — 2|)(2*[m + n] 


+50 *Im+n—1]- =a *[m +n — 2))] (3.249) 
= E[z[m]z{[m + n]] + sElr >[mja* a +n-1]]- 5B -[mja*[m +n — 2]] 


) E[x[m — 1]a*[m + n — 2]] — sBlelm — 2)x*|m + n]| 
2 


) E{x[m — 2]a2*[m +n —1]] + (5) ; E[z[m — 2]x*[m + n — 2]] (3.250) 


1 1 1 
= 5 Rae[n| + [Resin +1] - 5 Rex[n +2] + qRezln —1)- 5 Rex[n — 2] (3.251) 
3 1 i 1 1 1 
= gol + gmaina + gpoi2 ~ gears ~ gna rr 
Note that for |n| > 2, jv + 1] = |\n|+ 1 and |n + 2|= |n| +2, 
i 1 1 1 —_—_ 
Rigg ta] = In] aa 5+ — 93 T oT sz 33 = =} = 0, (3.253) 
for |n| = 1, 
Le fh | i i 
and 
Ls. 2 1 1 1 ; 
Ry] = +o + -ge- ge = 15 (3.255) 
Therefore, 
15 wn=0 
Ryy[n] = ¢ 0.75 if |n| =1 (3.256) 


0 if |n| > 2. 


We make two observations from Example 3.31. First, the output in this case is WSS. Indeed, this is 
true in general as we will now show. Second, the brute-force calculation of the autocorrelation is 
cumbersome. We can simplify the calculations by connecting the output to the autocorrelation of the 


input and the cross-correlation between the input and the output. 
Suppose that 


y(n] = 5 h[éjz[n — €] (3.257) 
&=—00 


and that x[n] is a WSS random process with mean m,, correlation R,,[k], and covariance C,,[k]. A 


pair of random processes x[n] and y[n] are jointly WSS if each is individually WSS and their cross- 
correlation is only a function of the time difference 


Ryy|n1,n2] = Rzy[n1 + k,n2 + ky, (3.258) 

so that the cross-correlation can be described using a single variable: 
Rzy[k] = Ezy [z[n]y*[n + kl] . (3.259) 
Similarly, the cross-covariance function C,,[k] of two jointly WSS random processes x[n] and y[1] is 
Cry|k] = Ezy [x[njy*|[n + kl] —mzm,. (3.260) 


Now we compute the mean, cross-correlation, and autocorrelation of the output of (3.257). The 
mean of the output signal is given by 


E, [y[n]] = 3 A[GEz[x[n — €] (3.261) 
£=—0o 
=m: > hd (3.262) 
f=—oo 
= mzh(e!?"°), (3.263) 


The last step follows from the definition of the DTFT, discussed further in Section 3.1.3. Among other 
things, notice that if the input is zero mean, then it follows from (3.263) that the output is also zero 
mean. 


The cross-correlation between the input and the output of an LTI system can be computed as 


Ryy[k] = Ez [z[n]y*[n + k]] (3.264) 
= S- h* |€E, |z[n]x*[n + k — €]] (3.265) 
&=—00 
= Sh" [QRza{k - (3.266) 
&=—o00o 
=" [e]) Fas lel: (3.267) 


The cross-correlation can then be computed directly from the convolution between the impulse 


response and the autocorrelation of the system. 
The autocorrelation of y[n] can be calculated as 


Big le] = Ey lub ijy*|n + k]] (3.268) 
- St a hie In — Ejy*|[n + k]] (3.269) 
&=—0o 
= =m h[é|Ray[k + 4 (3.270) 
f=—0co 
= h[-k] * Reylk]. (3.271) 


Substituting (3.267) into (3.271), 
Ryy[k] = h|—k| * h* [k| * Rz»|k}. (3.272) 


If the LTI system is bounded input bounded output stable, then the quantity h[—k]*h*[k] is well 
defined (is not infinity, for example). Then from (3.263) and (3.272), the output process is WSS. 


Using these results, in Example 3.32 we reconsider the calculation in Example 3.31. 


Example 3.32 
Consider the same setup as in Example 3.31. Compute the mean and autocorrelation function. 
Answer: From (3.262), 


dan 
Ely(n]] =Elz{n]] S> ala (3.273) 
&=—00 
=) (3.274) 
since the process is zero mean. 
To compute the autocorrelation function from (3.272) we need 


h{—k] « h*[k] = he \h[é — (3.275) 
L=0 

1 ‘i ‘ 1 \ o 3 , e 1 ‘i ” 1 ‘ ~ ‘ i! "RF 

= —Zo|k +2] + Zo[k +1] + 5d[k] + Z6[k—1]—55[k—2]. (3.276) 


Substituting into (3.272) gives 


| 3 1 1 


Ryy|k] - —5 fer [k 7% 2] TS {Ree [hk + 1] y 5 Rez lh “ q Raxlk = 1] ~~ 9 


which is identical to (3.251). Similar simplifications lead to the final answer in (3.256). 


Example 3.33 
A random process v[n] is white noise if the values v[1;], v[n;] are uncorrelated for every n; and n,, 
that 1s, 


Cy|[ni, 23] = 0, Vn, #n;. (3.278) 


It is usually assumed that the mean value of white noise is zero. In that case, for a WSS white noise 
process 


Cy[ni, nj] = Ry[ni,nj] = Ry[k] = 0265[k], (3.279) 


with k =n; — n;. Suppose that a WSS white noise process v[m] is the input to an LTI system with 
impulse response /[n] = d[n] + 0.86 [” — 1]. Compute the autocorrelation of the output of the LTT 
system. Is the output an uncorrelated process? 


Answer: From (3.272), 


Ryy[k] = h[—k] * h * [k] * Rpa[k] (3.280) 
= (d[k] + 0.86[k + 1]) « (6[k] + 0.86[k — 1]) * o26[k] (3.281) 
= (0.86[k + 1] + 1.6d[k] + 0.86[k — 1]) * 026k] (3.282) 
= 02 (0.86[k + 1] + 1.64[k] + 0.85[k — 1]). (3.283) 


From the expression of the autocorrelation it is clear that filtered white noise samples become 
correlated. 


3.2.8 Gaussian Random Processes 


The most common model for additive noise is the AWGN (additive white Gaussian noise) 
distribution. This refers to an IID random process where each sample is chosen from a Gaussian 
distribution. The real Gaussian distribution is written in shorthand as N (m, 6”) where for random 
variable v 


fo(x) = e207, (3.284) 


We will also encounter the circularly symmetric complex Gaussian distribution, written in shorthand 
as Ni-(m, oo). In this case, the real and imaginary parts are independent with distribution N (Re[m], 


07/2) and N(In[m], 67/2). It may be written as 


fo(z) =—se =, (3.285) 


ABS 


where x 1s complex. The relevant moments for an IID Gaussian random process are 


Mm, =m (3.286) 
Crz[n] = 076[n]. (3.287) 
An important fact about Gaussian random variables is that linear combinations of Gaussian random 


variables are Gaussian [280]. Because of this, it follows that a filtered Gaussian random process 1s 
also a Gaussian random process. This fact is illustrated in Example 3.34. 


Example 3.34 
Let x and y be two independent (0, 1) random variables. Show that z= x + y is N (0, 2). 


Answer: The distribution of the sum of two independent random variables is given by the 
convolution of the distributions of the two random variables [252, Section 6.2]. Letz=x+y, 


fil2) = fo(2) * fy(2) (3.288) 
1 ae t2—(2—t)? 
— e 2 dt (3.289) 
y) 
“al J =—0O0 
1 wy eo" fe 
=e | ew (t-2/2) ag (3.290) 
27 _ 
— err e 72 dt (3.291) 
2/7 =O \/ 2m 
1 wer ia (| 2 90 
= Te (3.292) 
__! -SS (3.293) 


Qn x 2 


=. 


Thus, z 1s M(0, 2). This can be generalized to show that if x is N (mz, a? ) and y is N (my, a2) and 


q 


they are independent, thenz =x + y is N(m, + My, o ae, : ) 


3.2.9 Random Vectors and Multivariate Random Processes 


The concepts of a random variable and a random process can be extended to vector-valued random 
variables and vector-valued random processes. This is useful for MIMO communication. In the case 
of random vectors, each point w, in the sample space S is associated with a vector X © R™ or C™” 
instead of a scalar value. In the case of a random vector process, each point a; in the sample space S 
is associated with a sequence of vectors x[n, w;]. Hence, for a fixed value of n, sayy = fi, x 7, w| 
is a vector random variable. This section defines some quantities of interest for vector random 
processes that will be useful in subsequent chapters. 


The mean of a random vector process x[”] of length M is 


m,|n] = Exjnj[x[n]] (3.294) 


Ez, {n} [v1[n]] 
— | Eesinj [alr] (3.295) 
Ee ay(n) [ea [n] 
The covariance matrix of a random vector process is 
Cxx (11, M2] = Exiny),x{ng] [ (x[n3] —m,(n]) (x[ng] — m,([n2|) "| (3.296) 
= Extny),x{na) [X[1]x*[n2]] — m,[n1]m}[n9]. (3.297) 


A similar definition is possible for the correlation matrix R,,[7, 1]. 


WSS vector random processes have a constant mean m, and a covariance that is only a function of 
the time difference C,,[”] = C,,[n. — 1]. For ID WSS vector random processes, C,,,[”] = Co[n] 
where C is known simply as the covariance matrix. If the entries of the vector are also independent, 


then C is a diagonal matrix. If they are further identically distributed, then C = oI. We deal primarily 
with IID WSS vector random processes in this book. 


Covariance matrices have some interesting properties that have been exploited in statistical signal 
processing to develop fast algorithms [143]. They are conjugate symmetric with C* = C. They are 
also positive semidefinite, meaning that 


x*Cx > 0, for allx e C™. (3.298) 


From the spectral theorem, for all conjugate symmetric matrices, there exists a decomposition 


C = Q*AQ, (3.299) 


where Q is a unitary matrix and A is a diagonal matrix. The columns of Q contain the eigenvectors of 
C, and A contains the eigenvalues of C on its diagonal. The eigenvectors and eigenvalues can be 
obtained from the eigendecomposition of C. In many cases it is desirable that the eigenvalues be 
ordered from largest to smallest. This can be achieved using the singular value decomposition [3 16, 
354). 

Multivariate Gaussian distributions [182] are widely used in the analysis of MIMO communication 
systems, as discussed more in Chapter 6. The multivariate real Gaussian distribution is written as NV 
(m,, C,) for random variable v where 


l (x- my)? cy} (x—my) 


——F —€ ? (3.300) 
(2n)* |Cy|2 


fv(z) = 


The multivariate circularly symmetric complex Gaussian distribution is written as \-(my, Cy) 
where 


1 wore 

—(x—my,)*C. °(x—my,) 9 Of 
—— v)"Cy ) (3.301) 
rM1Cy| 


fu(a) = 


Noise ina MIMO system is modeled as an IID multivariate random process with A/-(my,, C,)- 

The results of filtering a WSS random process can be extended to multivariate random processes 
by convolving x[”] with a multivariate impulse response H[n] as y[n] = }'pH[[]]x[” — 0]. We 
provide an illustration for a special case in Example 3.35. The other results in Section 3.2.7 can be 
similarly extended with some care taken to preserve the proper order of vector and matrix operations. 


Example 3.35 
Suppose that x[”] is a WSS random process with mean m, and covariance C,[k]. Suppose that x[7] is 
filtered by H[n] = Hd[n] to produce y[n]. Determine the mean and covariance of y[7]. 

Answer: The output signal is 


y(n] =) H[4x{[n - 9 (3.302) 
= Hx[n. (3.303) 
The mean is 
E [y(n] = E [x(n] (3.304) 
E [Hx[n]] = HE [x|n] (3.305) 
= Hm,. (3.306) 


Given Rex|h] = C,,. [k] +m,m*; the correlation matrix of y[7] 1s 


Ryy[k] = Ely[nly*[n + Al] (3.307) 
= E [Hx[n]x*|[n + k]H"| (3.308) 
=H (kin, (3.309) 
thus 
Cyy|k] = HR,,,[k|H* — Hm,m>,H" (3.310) 
= H(C,,{k] + m,m;) H* — Hm,m>H* (3.311) 


=HIC.. |k|H’: (3.312) 


3.3 Signal Processing with Passband Signals 


Wireless communication systems use passband signals, where the energy of the signal is 
concentrated around a carrier frequency f,. In this section, we present the fundamentals of signal 


processing for passband signals. The key idea is to represent a passband signal through its complex 
baseband signal representation, whichis also called its complex envelope. In this way, it is possible 
to decompose a passband signal into two terms, one that is a baseband signal without dependence on 
f, and another a complex sinusoid, which is a function of f,. A main advantage of the complex 


baseband signal representation 1s that it permits dealing with passband signals in a way that does not 
require explicit mention of the carrier frequency f,. This is convenient because the carrier frequency 


is typically added in the RF hardware. As a result, complex baseband signal representations permit 
an important abstraction of critical analog portions of a wireless system. 


This section reviews important concepts associated with passband signals. First, we describe 
upconversion, the process of creating a passband signal from a complex baseband signal. Then we 
describe downconversion, the process of extracting a complex baseband signal from a passband 
signal. In a wireless communication system, upconversion 1s performed at the transmitter whereas 
downconversion is performed at the receiver. Then we develop a complex baseband equivalent 
representation of a system that filters passband signals, and a related concept called the pseudo- 
complex baseband equivalent channel. We conclude with a discrete-time equivalent system, in which 
the transmitted signal, LTI system, and received signal are all represented in discrete time. This final 
formulation is used extensively in subsequent chapters to build signal processing algorithms for 
wireless communication systems. 


3.3.1 Upconversion—Creating a Passband Signal 

A real continuous-time signal x,(7) 1s a passband signal around carrier frequency f, if its Fourier 
transform x(f) = 0 for Jf € [feflow-SeShign] where fioy > 0 and fron > 0. For convenience, in this book 
double-sided signals are considered where hich =Slow = B/2. The exposition can be generalized to 


remove this assumption. The bandwidth of the signal is B, where the bandwidth is computed 
according to the definitions in Section 3.1.4. For the mathematical derivation, the continuous-time 
signals are assumed to be ideally bandlimited. 


Complex envelope notation is used to represent passband signals in a convenient mathematical 
framework. To develop the complex baseband signal representation, the bandwidth must be 
sufficiently smaller than the carrier frequency: B < f,. This is known as the narrowband assumption. 


The narrowband assumption 1s not limiting as it applies to virtually every wireless communication 
system except for some ultra-wideband systems [281]. 

Now we provide a representation of a passband signal x,(t) and subsequently simplify it to obtain 
its baseband equivalent. Suppose that x,(7) 1s a narrowband passband signal with carrier /,. A 
passband signal can be written as 


Lp(t) = A(t) cos(27f,t + (t)), (3.313) 
where A(¢) is an amplitude function and (f) is a phase function. In AM (amplitude modulation), 


information is contained in the A(t), whereas in FM (frequency modulation), information is contained 
in the derivative of the phase term (¢). The formulation in (3.313) is not convenient from a signal 


processing perspective since the o(¢) occurs in conjunction with the carrier inside the cosine term. 
Another way to represent passband signals is to use inphase and quadrature notation. Recall the 
trigonometric identity cos(A+B) = cos(A) cos(B)-sin(A) sin(B). Simplifying (3.313) leads to 
Lp(t) = A(t) cos(d(t)) cos(27 ft) — A(t) sin(d(t)) sin(27 fot). (3.314) 
——$_ ee” eek a aS 


xj(t) Lq(t) 


The signals x,(¢) and x,(t) are called the inphase and quadrature components. They are both real by 
construction. Based on (3.314), the passband signal x,(¢) can be generated from inphase x((¢) and 
quadrature x,(¢) signals through multiplication by a carrier. This process is illustrated in Figure 3.14 


and is known as upconversion. The multipliers are implemented using mixers in hardware. An 
oscillator generates a cosine wave, and a delay element would be used to generate a sine wave. More 
elaborate architectures may use multiple stages of mixing and filtering [271]. 


Re{x(t) } 


cos(27fct) 


— sin(27f.t) 
Figure 3.14 Mathematical block diagram model for direct upconversion of a baseband signal to a 


passband signal. The real analog RF hardware may not perform a direct conversion but rather may 
have a series of upconversions. 


Using complex envelope notation, the inphase and quadrature signals are combined into a single 
complex signal. Let x(t) = x,t) + jx,(t) denote the complex baseband equivalent or complex envelope 
of x,(¢). Recall from Euler’s formula that e” = cos(0) + j sin(@). Multiplying the complex baseband 
equivalent by the complex sinusoid ec", then 


r(t)el?rFet — xi(t) cos(27 f.t) — rq(t) sin(27 f.t) 
+ j(xj(t) sin(27f,t) + rq(t) cos(27 fi t)). (3.315) 


The real part corresponds to the passband signal in (3.314). The result can be written more compactly 
as 


tot) = Re [x(te**"**| . (3.316) 


The expression in (3.316) shows how x,(t) 1s separated into a term that depends on the complex 
baseband equivalent and a term that depends on the carrier f,. 


Now we establish why x(¢) in (3.316) is necessarily bandlimited with bandwidth B/2. First rewrite 


(3.316) as 

eo) = oe + <a*(t)e7 inset (3.317) 
to remove the real operation. Using the Fourier transform properties from Table 3.1, 

xp(f) = ; (x(f — fe) + x"(-f — fe))- (3.318) 


Essentially, a scaled version of x(f) is shifted to f,, and its mirror image x *(—/) is shifted to —f,. The 
mirroring of the spectrum in (3.318) preserves the conjugate symmetry in the frequency domain, 


which is expected since x,(¢) is a real signal. The main implication of (3.318) is that ifx(¢) is a 
baseband signal with bandwidth B/2, then x,(¢) is a passband signal with bandwidth B. This process 
is illustrated in Figure 3.15. 

Baseband x(f) 


BI? Sf 


Lh Upconversion 


Passband itn) 


Te Te I 
Figure 3.15 Upconversion in the frequency domain, which takes a baseband signal to a passband 
signal. The scaling factor A is used to show how the amplitude scales in each case. 


Example 3.36 
Let x(t) be the complex envelope of a real-valued passband signal x,(7). Prove that x(Z) is generally a 
complex-valued signal and state the condition under which x(¢) is real valued. 


Answer: Because x,(t) 1s a passband signal, it can be written as 


Lp(t) = Re[x(t)e!?*Fe4] (3.319) 
= Re|x(t)(cos(27f.t) + jsin(27ft)| (3.320) 
= Re[x(t)] cos(27f,t) — Im|[x(t)]| sin(27 f,t). (3.321) 


By equating like terms in (3.321) and (3.314), we see that Re[x(1)] = x(¢) and In] x(t)] = x,(2). 
Therefore, x(¢) is generally complex valued. It is real valued when x,(t) = 0, that is, x,(t) does not 
have a quadrature component. 


3.3.2 Downconversion—Extracting a Complex Baseband Signal from a 
Passband Signal 


Downconversion is the reverse of upconversion. It 1s the process of taking a passband signal and 
extracting the complex baseband equivalent signal. In this section, we explain the process of 
downconversion. 


Suppose that y,,(7) is a passband signal. Because it is passband, there exists an equivalent baseband 
y(t) such that 


yp(t) = Re [y(t)e?™4«*] . (3.322) 


To understand how downconversion works, recall the following trigonometric identities: 


: 4% | 
sin(w) sin(v) = 71 [cos(u — v) — cos(u + v)] (3.323) 
| 
cos(u) cos(v) = 5 [cos(u — v) + cos(u + v)] (3.324) 
a | | _ 
sin(u) cos(v) = . [sin(u — v) + sin(u + v)}. (3.325) 
Using (3.323)-(3.325): 

1 1 1 a eee 
Yp(t) cos(27f,t) = 5 Milt) + 5 ¥ilt) cos(4z fet) — 5 alt) sin(4z7 ft) (3.326) 
ee 1 1 | 1 , ie 
Yp(t) sin(27 fet) = —F¥q(t) + 5Yq(t) cos(4m fet) + Syi(t) sin(4z fet). (3.327) 


The 2f, components of the received signal are significantly higher than the baseband components. To 


extract the baseband components, therefore, it suffices to apply a lowpass filter to the outputs of 
(3.326) and (3.327) and correct for the scaling factor. 


We can explain downconversion mathematically in a compact fashion. An ideal lowpass filter with 


unity gain and cutoff B/2 has a Fourier transform given by rect(//B) or Bsinc(tB) in the time domain 
from Table 3.2. The downconverted signal can be written as 


y(t) = 2Bsinc(tB) * (yp(tye aaa Zia (3.328) 


This expression captures the key ideas in a way that leads to a complex baseband signal. 


Figure 3.16 shows a block diagram for downconversion. Effectively, the downconversion 
operation shifts the spectrum of the passband signal through multiplication by sine and cosine, then 
filters out the remaining higher-frequency replicas and corrects for the scaling factor. As is the case 
for upconversion, there are many practical hardware architectures to implement downconversion 
[271]. 


Xp\ ft) 


SZ 


ZS : 
x» LPF 


4 


cos(27f.1) 
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—sin( 27/0) 

Figure 3.16 Mathematical block diagram for direct downconversion of a passband signal to a 
baseband signal. The real analog may not perform a direct conversion but rather may have a series 
of upconversions and downconversions. Additionally, there is typically a low-noise amplifier and 

a bandlimiting filter just after the antenna and a series of gain control circuits. 


It is useful to understand how downconversion works in the frequency domain. Taking the Fourier 
transform of (3.328), the downconversion process can be expressed mathematically as 


y(f) = 2 rect (3) Yp(f + fe). (3.329) 


Essentially, as illustrated in Figure 3.17, the entire spectrum of y(t) is shifted down by f,, and then a 
filter is applied to remove the high-frequency image. 


Passband yp(/) 


y Downconversion 
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ft 


B/2 
Figure 3.17 Downconversion in the frequency domain, which takes a passband signal to a 
baseband signal. The scaling factor A is used to show how the amplitude scales in each case. 


Equations (3.329) and (3.328) assume an ideally lowpass filter exactly matched to the bandwidth 
of the signal, but a wider nonideal lowpass filter could be substituted, leading to the same 
mathematical result. In practice, the bandwidth is typically taken to be near B for considerations 
related to noise and adjacent channel interference rejection. 


The connection between baseband and passband signals is important in viewing wireless 
communication systems through the lens of signal processing. Throughout this book, when dealing 
with wireless communication signals, only their complex baseband equivalents are considered. Issues 
related to imperfections in the upconversion and downconversion, due to differences between the 
carrier frequency at the transmitter and the receiver, are explored further in Chapter 5. 


3.3.3 Complex Baseband Equivalent Channel 


Thus far we have established a convenient representation for generating passband signals from 
complex baseband signals, and for extracting complex baseband signals from passband signals. 
Upconversion 1s used at the transmitter to create a passband signal, and downconversion 1s used at 
the receiver to extract a baseband signal from a passband signal. These are key operations performed 
by the analog front end in a wireless communication system. 


From a signal processing perspective, it is convenient to work exclusively with complex baseband 
equivalent signals. A good model for the wireless channel is an LTI system as argued in Section 3.1.2. 
This system, though, is applied to the transmitted passband signal x,(¢) to produce the received 


passband signal 
yp(t) = / he(t — 7)ap(r)dr (3.330) 


as illustrated in Figure 3.18. Because it is at passband, the input-output relationship in (3.330) is a 
function of the carrier frequency. Based on the results in Section 3.3.2, it would be convenient to have 
an equivalent baseband representation that is just a function of the complex envelopes of the input 


signal and the output signal for some baseband channel /(t). In other words, it would be nice to find a 
relationship 


nt) = / h(t — r)a(7)dr (3.331) 


f 
Figure 3.18 Convolution between the passband signal and the channel. In this case the total 


response of the channel h,(/) is illustrated. 


for some suitable A(t). The complex baseband equivalent channel A(t) thus acts on the input signal x(7) 
to produce y(f), all at baseband. 

In this section, we summarize the steps in the derivation of the complex baseband equivalent 
channel h(t) from the real channel ,(t) that is not bandlimited. The idea illustrated in Figure 3.18 is 
to recognize that only a portion of the channel response impacts the passband signal. Then an 
equivalent passband channel response can be considered as illustrated in Figure 3.19, without any 
loss. Finally, the passband channel is converted to its baseband equivalent in Figure 3.20. 


lyp(f)| 


f 
Figure 3.19 Convolution between the passband signal and the channel 


f 
Figure 3.20 Convolution between the baseband signal and the baseband equivalent channel 


Now we provide the mathematical steps for deriving the complex baseband equivalent channel. We 
work with the frequency domain, following Figure 3.20. The objective is to find the baseband 
equivalent channel h(f) such that y(/) = h(/)x(/). Attention is needed to the scaling factors in the 
derivation. 


Consider the received signal 
Yp(f) = he(f)xp(f). (3.332) 
Because x,(f) is a passband signal, only the part of h,(f) that is within the passband of x,(f) is 
important. Let us denote this portion of the channel as h,(/). 


The passband channel h,(f) is derived by filtering h(f) with an ideal passband filter. Assuming that 
x(f) has a passband bandwidth of B, the corresponding ideal passband filter with bandwidth of B 
centered around /, is 


LB cacao . (FFD — 

p( f) = rect ( B ) + rect ( B ) (3.333) 
— cenit ven Leh RR — 
= rect ( B ) + rect ( B ) (3.334) 


or equivalently in the time domain is 


p(t) = 2B cos(27 f,t)sinc( Bt). (3.335) 


Since x,(f) = x,(f)p(f), it follows that 


Yp(f) = he(F)P(f)xp(F) (3.336) 
= hp(F)%p(f) (3.337) 

where 
hp(f) = p(f)he(F) (3.338) 


is the passband filtered channel. 


Now suppose that h(/) is the complex baseband equivalent channel so that y(/) = h(/) x(f). The 
passband signal obtained using (3.318) is then 


if 1 
y(f) = sh(f — fe) xf — fe) + sh'(—-F — fe)x"(-f — fe). (3.339) 
But it is also true that 
yp(f) = hp(f)xp(f). (3.340) 


Let h,(f) be the complex baseband equivalent of h,(/). Substituting for h,(f) and x,(/) again using 
(3.318), 


1 Fs et kg boo & a ae Lae 
yp(f) = (Sho(t at fe} + 5h (—f a f.)) (; x(f =< fom + 9 x (—f a f.)) (3.341) 
1 
= qhe(s fe) (Ff = fe)+ ahe'(-f Sg ee (3.342) 


Equating terms in (3.342) and (3.339), it follows that 


1 , ere 

h(f) = shp(f). (3.343) 
The factor of 1/2 arises because the passband channel was obtained by passband filtering a non- 
bandlimited signal, then downconverting. With similar calculations as in (3.329), 


ea — : 2 rect (4) ho( f+ fe) (3.344) 
= rect (4) rect (4) he(f + fe) + rect (a4) he(f — f) (3.345) 
= rect (Z) he( f + fe). (3.346) 


Compared with (3.329), the factor of 1/2 cancels the factor of 2 in (3.346). The entire process of 
generating the complex baseband equivalent channel is illustrated in Figure 3.21. 


Ihp(/)| 


f 
Figure 3.21 Frequency-domain illustration of the transformation from h,(f) to h,(/) to hy(/) to h(/). 
An arbitrary scaling is selected in h,(f) just to illustrate subsequent rescalings. 


The time-domain response follows directly from the Fourier transform properties in Table 3.1: 
h(t) = Bsinc( Bt) * h,(t)e?*F«t (3.347) 
= B [ sinc a(t — T))he(r)eb?*Fe7 dr. (3.348) 


Essentially, the complex baseband equivalent channel is a demodulated and filtered version of the 
continuous-time channel /,(t). The time-domain operations are illustrated in Figure 3.22. Several 


example calculations are provided in Examples 3.37 and 3.38. 


— sin(27 f,t) 


Figure 3.22 Creating the complex baseband equivalent by demodulating the channel 


Example 3.37 


Suppose that the signal has to propagate over a distance of 100m to arrive at the receiver and 
undergoes an attenuation of 0.1. Find a linear time-invariant model for this channel /,(¢), then find the 


passband channel /,(t) and the baseband equivalent channel /(t). The baseband signal bandwidth is 
5MHz and the carrier frequency is f, = 2GHz. 


Answer: Consider the speed of propagation to be c = 3 x 108m/s. Then the delay between the 
transmitter and the receiver is ty = 100/c = 1/3us = 1/3 x 106s. Consequently, the channel can be 
modeled as 


h,(t) = 0.16(t — 1/3 x 107°). (3.349) 


The baseband signal bandwidth is 5MHz; thus the passband bandwidth is B = 1OMHz. Now 


hy(t) = h_(t) * p(t) (3.350) 
= [ — T)h.(r)dr (3.351) 
=}, 1 | p(t — r)d(t — 1/3 x 10~®)dr (3.352) 
= 0.1 p(t — 1/3 x 107°) (3.353) 


= 2x 0.1 x 10’ cos(272 x 10°(t — 1/3 x 10~®))sine(10“(t — 1/3 x 10~®)). (3.354) 


Similarly, to find the baseband channel, we use (3.348): 
h(t) = 0.1 x 10° / sinc(10"(t — T))6(r — 1/3 x 1078) e327 210" G7 (3.355) 
= 0.1 x 10’sinc(107(t — 1/3 x 107®))e7527 5% 10", (3.356) 
Simplifying further given the numbers in the problem, 
h(t) = 10°sinc(107 (t — 1/3 x 10~°))e7i73. (3.357) 


Example 3.38 
Consider a wireless communication system with the carrier frequency of f, = 900MHz and the 
absolute bandwidth of B = 5MHz. The two-path channel model is 


h(t) = 6(t — 2 x 107°) + 0.55(t — 5 x 107°). (3.358) 
¢ Determine the Fourier transform of /,(2). 


Answer: Let ¢; = 2 x 10° and t, = 5 x 10 © and rewrite the channel response in the time 
domain as 


h(t) = 6(¢ — t,) + 0.56(¢ — ta). (3.359) 
In the frequency domain the channel response is 
h(f) = e Jr sett 4 0. 5eI2t feta (3.360) 
and the magnitude (suitable for plotting) is 
Ih(f)| = (1.25 + cos(2af (ti — t2)))/”? (3.361) 
= (1.25 + cos(6 x 10-8 f))/?. (3.362) 


* Find the passband channel /,(t) in both time and frequency domains. 


Answer: In the time domain 


hy(t) = he(t) * p(t) (3.363) 
= p(t — #1) + 0.5 p(t — ta). (3.364) 
In the frequency domain 
hp(f) = he(f)p(F) (3.365) 
: eo for f € [—f. — B/2,—f. + B/2] or [fe - B/2, fe + B/2 
0, elsewhere 


(3.366) 
= (e~J2r feta + 0.5¢e7J27 Jet2 ) rect ( 5) + rect @ = fe) . (3.367) 


The magnitude of the channel is 


Ihy(f)| = [1.25 + cos(6 x 10~°rf)] iA rect z = fe + rect f 5 Te 
B B 
(3.368) 


¢ Find the complex baseband equivalent channel h(t) in both time and frequency domains. 


Answer: Applying (3.348), in the time domain 


h(t) = B | sine(B(t — 1))[6(t — th) + 0.5 6(¢ ~ tale" dr (3.369) 
= B [sinc(B(t — t,))e 77 Ft 4 0.5 sinc(B(t — to) )e I?m Feta | (3.370) 
= B|sinc(B(t — t1)) + 0.5 sine(B(t — t2))) (3.371) 


In the frequency domain 


h(f) = [een Fots + Oise Fr seFa) rect =. ‘ (3.372) 
The magnitude of the frequency response is 
Ih(f)| = + [1.25 -+ cos(6 x 10-F f))”? rect (—F__ (3.373) 
‘ 1 10 x 106 }” 


3.3.4 Pseudo-baseband Equivalent Channel 


For the special case when the channel is a sum of delayed impulses, there is an alternative to the 
complex baseband equivalent channel. We refer to this as the pseudo-baseband equivalent channel; 
this terminology is not standard in the literature. 


To derive the pseudo-baseband equivalent channel, consider the following channel with R rays or 
paths, each with amplitude a,. and delay 1,.: 


R-1 
h(t) = )— a,6(t — 7;). (3.374) 


ro 
Generalizing the results on the two-path channel models, it follows that the complex baseband 
equivalent is 
R-1 
Rit} = or a,e IFT Bsinc(B(t — 7,)). (3.375) 
r=0 
Note that this can be rewritten by factoring out the sinc function as 
R-1 
h(t) = Bsinc( Bt) x 5. aye 12" FeTr §(¢ — 7,). (3.376) 
r=0 
We refer to the quantity hip ft) = ) als ape I? feTr §(¢ — 7,,) as the pseudo-baseband equivalent 
»b r—0 “tr® T 


channel. This channel has complex coefficients o,.e ct but otherwise is strikingly similar to h(t). 
Furthermore, since the signal is bandlimited, it follows that y(¢) = A(Z) * x(t) = hy (0) * x(2), So App (2) 


can still be convolved with x(t) to obtain the complex baseband equivalent signal y(t). Unfortunately, 
A p(t) itself is not bandlimited and cannot be sampled; therefore, caution should be used when 


deriving the discrete-time model as described next. 


Example 3.39 
Find the pseudo-baseband equivalent channel for the case described in Example 3.37. 


Answer: Since /,(¢) = 0.1 8(t— 1/3 x 10), it follows that 


Rpp(t) = 0.1e7J27 5 * LO" S(t —1/3x 1or*) (3.377) 
= 0.1e "8 5(t — 1/3 x 10~%). (3.378) 


Example 3.40 
Find the pseudo-baseband equivalent channel for the case described in Example 3.38. 


Answer: Since /,(t) = o(¢ — 2 x 10°) + 0.5 8(t — 5 x 10°), it follows that 
hpp(t) ~ e—J27900x 10° x2x 10 6(t _9x 10-®) 
At 0.5e7427900 x 10" X5x107" §(¢ = 5 x 107") (3.379) 


= 6(t —2 x 10-°) +:0.56(¢ — 5.x 107°). (3.380) 


In this case the pseudo-baseband equivalent channel is the same as the original channel since the 
complex phase effects vanish because of the choice of numbers in the problem. 


3.3.5 The Discrete-Time Equivalent Channel 


The bandlimited property of complex baseband equivalent signals allows us to develop a system 
model that depends entirely on the sampled complex baseband equivalent signals and a discrete-time 
equivalent channel. From this representation it will be clear how digital signal processing can be 
used to generate the transmitted signal and to process the received signal. 


Suppose that x(t) and y(t) are bandlimited complex baseband signals with bandwidth B/2 and that 
the sampling period 7’ > 1/B satisfies Nyquist. The complex baseband signal x(¢) to be transmitted can 
be generated from its samples {x[n]} using the reconstruction formula in (3.82) as 

= 
Te me x[n|sine((t — nT) /T)). (3.381) 
In practice, x(t) would usually be generated using a pair of digital-to-analog converters (one for the 
inphase component and one for the quadrature component). The complex baseband signal y(f) at the 
receiver can be periodically sampled without loss as long as Nyquist is satisfied to generate 


y[n] = y(nT), (3.382) 


which would be performed with a pair of analog-to-digital converters (one for the inphase component 
and one for the quadrature component) in practice. 


Applying the results from Section 3.1.6, the input-output relationship in (3.331) can be written 
equivalently in discrete time as 


yln] = S_ hfkjx[n — k] (3.383) 


k=-—00 


where h[n] = Th(nT) since h(t) is already bandlimited by construction in (3.346). 


The sampled channel A[n] is known as the discrete-time complex baseband equivalent channel. It is 
obtained from the continuous-time channel by combining (3.114) and (3.348) as 


h{n] = TB / sinc(B(nT — T))he(r)e- 774" dr. (3.384) 


In the special case of critical sampling where 7 = 1/B (often used for examples and homework 
problems), the scaling factors disappear and 


h{n] = | sinc(n — Br)he(r)e7!?7F7 dr. (3.385) 


The importance of discrete-time signal processing to digital communication systems results from 
the use of passband communication signals. Exploiting the passband property to develop complex 
baseband equivalents of the input and output signals and the resulting bandlimited property of those 
signals allows the development of a discrete-time signal model. The steps of abstraction are 
summarized in Figure 3.23. Throughout this book we will continue refining this model, adding 
additional impairments like additive noise and eliminating assumptions like perfect downconversion 
as we progress. 


DOWNCONVERSION — 


—>| ne) y(t) 
vy 
x{n| y|n] 


Figure 3.23 Refinement of the digital communication system 


Example 3.41 
Find the discrete-time complex baseband channel /[n] obtained from h(t) for Example 3.37 with 
sampling exactly at the Nyquist rate. 

Answer: The baseband signal bandwidth is 5MHz; thus the passband bandwidth is B = 10MHz. 
The Nyquist frequency of the baseband signal is 5MHz; thus the Nyquist rate is 1OMHz. Sampling at 


the Nyquist rate, T= 1/10’s. Since h(t) is already bandlimited, the Nyquist sampling theorem can be 
applied directly: 


h|n| = T h(nT) (3.386) 
= Fp7 0-1 x 10’sine(107(n/107 — 1/3 x 107) )e— i275 x10" (3.387) 
= 0.1 sinc(n — 10/3)e7i*8. (3.388) 


The relatively simple channel that delays and attenuates the transmitted signal results in an impulse 
response that attenuates, phase shifts, and convolves the received signal with a sampled shifted sinc 
function. In special cases, further simplifications are possible as shown in the next example. 


Example 3.42 


Find the discrete-time complex baseband channel h[n] obtained from h(t) for Example 3.38 with 
sampling exactly at the Nyquist rate. 


Answer: From Example 3.38, B =5 x 10°H7z; therefore, sampling at the Nyquist rate, 7= 1/B = 0.2 
x 10s. The discrete-time complex baseband channel is 


h[n|] = T h(nT). (3.389) 
Substituting for A(t) from (3.371), 
hn] = TB sinc(n — Bt) + 0.5 sine(n — Bt2) (3.390) 
where ¢, = 2 x 10° °s and t, = 5 x 10°°s. Substituting for all variables, 
h|n] = sinc(n — 10) + 0.5 sine(n — 25) (3.391) 
= §[n — 10] + 0.56[n — 25]. (3.392) 


The final simplification occurs because sinc(n) = 6[n]. The delays in this problem were specially 
selected relative to the bandwidth to create this final simplification. 


3.4 Multirate Signal Processing 


As the book progresses, there will be several opportunities to employ digital signal processing 
techniques at either the transmitter or the receiver to simplify implementation and improve overall 
radio flexibility. To that end, multirate signal processing will be useful. Multirate signal processing is 
a class of DSP techniques that facilitate changing the sampling rate after sampling has occurred. In 
this section, we review some important concepts from multirate signal processing, including 
downsampling, upsampling, polyphase sequences, filtering, and interpolation or resampling. 


3.4.1 Downsampling 


Downsampling, also called decimation, is a means for reducing the sample rate of a discrete-time 
sequence. Let M be a positive integer. Downsampling a signal by / involves creating a new signal 
where every M/ —1 samples in the original sequence are thrown away. The block diagram notation for 
downsampling is |M. Its application and a graphical example are provided in Figure 3.24. 


0 | 2 0 | 
Figure 3.24 Downsampling of a discrete-time signal. The top figure illustrates the downsampling 
notation for use in the block diagram, and the bottom figures illustrate the downsampling operation. 


To explain the operation of the downsampler, let z[”] be the input and y[7] the output. 
Mathematically the input and output are related as 


y(n] = z[nM]. (3.393) 
Samples in z[n] are discarded to produce y[n]; therefore, information is lost. The reduction in 


information creates an aliasing effect in the frequency domain: 
M-1 


ry 39 . l sar ie) 1 Dare fj ’ 
y(e?"F) = a y x (<4) M—j27m, a (3.394) 


m=0 


Because of aliasing, it is not possible in general to recover z(e”™) from y(e?). Filtering the signal 
z[n] prior to downsampling can eliminate aliasing in (3.394). 


Example 3.43 
Consider the samples of the function x(t) = sin(27 ft) + sin(27fot + %) att = nz for f| = 


18KHz, f, = 6KHz, and a sampling frequency f, = 48KHz. Some of these samples are shown in Figure 
3.25(a) forn=1,..., 20. The magnitude spectrum of the resulting sequence can be seen in Figure 
3.25(b), with the frequency axis in normalized units. The signal is decimated by M = 2, and part of the 
resulting sequence is shown in Figure 3.25(c). The spectrum of the decimated version of the signal is 
shown in Figure 3.25(d). Explain why the magnitude spectrum of the decimated signal consists of 
only two impulses. 
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(c) (d) 
Figure 3.25 (a) Original sequence obtained by sampling the two sinusoids in Example 3.43; (b) 
magnitude spectrum of the sequence in (a); (c) sequence in (a) decimated by 2; (d) magnitude 
spectrum of the decimated signal 

Answer: The spectrum of the original sequence consists of impulses at +F, and +F,, which 
correspond to the normalized frequencies F, = A = jandF2 = a = 3. After decimating the 
signal, the new sampling frequency is f’ = 24] Hz, and only two impulses can be observed at 
Q= 1. 
= 

This result can be explained by building the spectrum of the decimated signal from (3.394). When 
decimating a signal by M = 2, every component at frequency Q; appears at 2Q;. The impulses in the 
original spectrum atQ = +4 appear now atQ) = + A. The impulses at(Q) = +3 go to 


Q=- +$ > |1|and can be observed also at +(2 — S) = + >" Thus, because of the aliasing effect, a 


single impulse resulting from the sum of both components is observed in the spectrum of the 
decimated signal. 


3.4.2 Upsampling 

Upsampling is a transformation of a discrete-time sequence that involves inserting zeros after every 
sample. Let L be a positive integer. Upsampling a signal by Z means that a new signal is produced 
where each sample of the original signal 1s followed by Z — 1 zeros. In block diagrams, the 
upsampler is written as TL. Its application and an example are illustrated in Figure 3.26. 


s[n] —+(t L)—+ s[n] 


s{0] s[0] 


L—1 zeros L— 1 zeros 
Figure 3.26 Upsampling of a discrete-time signal. The top figure illustrates the upsampling 
notation for use in a block diagram, and the bottom figures illustrate the effect of upsampling— 
inserting zeros in the signal. 


Let s[n] be the input to an upsampler, and let 52 | be the output. When there is no possibility of 
confusion, for simplicity we drop the subscript L. Mathematically, the input-output relationship of the 
upsampler can be written as 


s[n] = s{k]é|[n — kL). (3.395) 
l J J 
k 


Using Fourier theorems, the input and output signals are related in the frequency domain as 
5(e)?7F) = s(e)?7 FF). (3.396) 


In the frequency domain, the upsampling operation has the effect of compressing the spectrum. There 
is no aliasing because samples are not lost in the upsampling process. 


Example 3.44 


Consider the system in Figure 3.27, including downsampling and upsampling operations. Compute the 
expression of y[”] in terms of x[7]. 


15) be) 


Figure 3.27 Block diagram of a system including a combination of downsampling and upsampling 
operations 


Answer: Using the fact that 5 and 3 are coprime numbers, the upsampler and the decimator 
commute, so we can switch the order of the first two blocks. Then, note that the 5-fold expander 
followed by the 5-fold decimator is equivalent to an identity system. Thus, the system is equivalent to 
a 5-fold decimator followed by a 5-fold upsampler, and consequently 


a[n], n=5k, k=0,1,2,... 
y|n| = Py ° | (3.397) 


0, otherwise. 


3.4.3 Polyphase Decomposition 


Multirate signal processing often involves decomposing a sequence into its polyphase components 
and processing each component separately. Let M be a positive integer. The signal s,,[n] = s[nM + m] 


denotes the m'" polyphase subsequence of s[7]. 


A sequence can be decomposed based on its polyphase components. Writing s[7] using the 
Kronecker delta function, 


s(n] = S— s[k]6[n — kl. (3.398) 
k=—oco 
Rewriting the sum over Mr + m instead of k, 
M-1 oo 
s|n| = >. s|[Mr + m]é{n — Mr — m] (3.399) 
m=0 r=—0co 
M-1 ox 
= S- Sm|r|6[n — Mr — ml]. (3.400) 
m=0 r=—co 


There is no information lost when decomposing a signal into its polyphase components. The operation 
of decomposing s[7] into its polyphase components is illustrated in Figure 3.28. 


Figure 3.28 Decomposing a signal into its polyphase components. The notation z*! is used to refer 
to an operation that advances the signal by one sample. 


Rewriting (3.400) gives us a clue about how to reconstruct a sequence from its polyphase 
components using the upsampling operation. Recognizing the connection between (3.400) and 
(3.395), (3.400) can be rewritten as 


M-1 


s[n] = >. Sm|[n — m] (3.401) 


m=0 


where Sm? is the output of upsampling by M. The reconstruction of a signal from its polyphase 
components is illustrated in Figure 3.29. 


Figure 3.29 Reconstructing a signal into its polyphase components. The notation z ! is used to 
refer to an operation that delays the signal by one sample. 


Example 3.45 


Consider the sequence s[n] = 6[n]+0.56[n—1]+0.756[n—2]—0.256[n—3] and find its polyphase 
components when M = 2. 


Answer: From (3.400) the polyphase components of a signal are s,,[n] =s[nM + m],0<m<M— 
1. In this simple example there are two polyphase components: 


so[n] = s[2n] = d[n] + 0.756[n — 1], (3.402) 
s[n] = s[2n + 1] = 0.56[n] — 0.256[n — 1]. (3.403) 


3.4.4 Filtering with Upsampling and Downsampling 


Multirate signal processing often occurs in conjunction with filtering. Multirate identities are used to 
exchange the order of operations between upsampling or downsampling and the filtering 
(convolution) operations. In software-defined radio, smart applications of multirate identities can 
reduce complexity [273]. 


The downsampling filtering identities are illustrated in Figure 3.30. The downsampling 
equivalence shows how to exchange filtering after downsampling with filtering (with a new filter) 
before the downsampling operation. In the time domain, the relationship 1s 


y(n| = z[nM] * gin] (3.404) 
— pa z|kM]g|n — k] (3.405) 

k 
- » z|k|g|nM — k). (3.406) 


: 
An] —~(6i)—+| etm) | y{n] 


Figure 3.30 Downsampling equivalence. Filtering a downsampled signal can be equivalently 
performed by downsampling a signal filtered with an upsampled filter. 


By upsampling the filter g[n] by M to create 9 in} the convolution can be performed prior to the 
downsampling operation. 


The upsampling filtering identities are illustrated in Figure 3.31. The upsampling equivalence 
shows how to exchange filtering before the upsampler with filtering after the upsampler. In the time 
domain, the equivalence is 


x s|mlg|k — nt) d{n — kL] (3.407) 


m 


— 2: (>: s|k]é[m — x) (> g|plo[n — m — ru) (3.408) 


k Pp 


— pa s|m|gin — m). (3.409) 


s[n] ® x[n] 
s[n] —+(F1)}—+| atm) -— x[n] 


Figure 3.31 Upsampling equivalence. Filtering a signal and then upsampling is equivalent to 
upsampling the signal and then filtering with an upsampled filter. 


Essentially the upsampling equivalence states that a filter signal can be upsampled or the upsampled 
signal can be convolved with the upsampled filter. 


3.4.5 Changing the Sampling Rate 


The sampling rate at the transmitter is often determined by the rate of the digital-to-analog converter 
whereas the sampling rate of the receiver is determined by the rate of the analog-to-digital converter. 
It is useful when correcting for more sophisticated receiver impairments to increase the sampling rate 
to mimic the case where oversampling (sampling with a higher-rate analog-to-digital converter) is 
available. It may also be valuable in prototyping to design a system that uses very-high-rate 
converters (much more than the minimum needed to satisfy Nyquist), but the subsequent processing is 
performed at a lower rate to reduce complexity. 


Increasing the Sampling Rate by an Integer 


Consider a bandlimited signal x(t) that is sampled to produce x[n] =x(nT) and the sampling period T 
is sufficient for the Nyquist sampling theorem to be satisfied. Suppose that we would like z[n] = 
x(n1/L) where L is a positive integer. Since x(f) is bandlimited, 


z= S > a[m| sinc((t — mT) /T), (3.410) 
therefore, 


x(nT/L) = )~ x{m] sine((nT/L — mT)/T) (3.411) 


m 


= S— x[m sinc((n — mL)/L). (3.412) 


mr 


Increasing the sampling rate requires upsampling by L, then filtering with an ideal discrete-time 
lowpass filter with gain Z and cutoff 1/Z as illustrated in Figure 3.32. 


LPF | 
x[n| —-(2)}-—+ Gain L z|n| 
Cutoff 1/L 


Figure 3.32 Increasing the sampling rate by an integer factor L, also known as interpolation 


Decreasing the Sampling Rate by an Integer 


Decreasing the sampling rate is slightly different from increasing the rate because in this case 
information is lost. Assume that we have a bandlimited signal x(t) sampled at sampling period T such 
that the Nyquist sampling theorem is satisfied. Suppose that a signal with sampling period 7 is 
required where Mis a positive integer. If 7M does not satisfy the Nyquist sampling theorem, then 
aliasing will destroy the integrity of the original signal. The typical solution is to lowpass filter the 
discrete-time signal prior to downsampling as illustrated in Figure 3.33. Mathematically this leads to 


x(n] = M S > x[m| sinc((n — m)/M) (3.413) 


mm 


LPF 
x[n] Gain I x[n| 
Cutoff 1/M 


Figure 3.33 Decreasing the sampling rate by an integer factor M, also known as decimation 


where the factor of M is required to maintain the appropriate scaling factor in the signal. Lowpass 
filtering before downsampling is equivalent to lowpass filtering the continuous-time signal 
corresponding to bandwidth 1/2MT prior to sampling. 


Changing the Sampling Rate by a Rational Factor 


Given a signal sampled at rate 1/7, suppose that a signal sampled at rate L/TM is desired where M 
and L are positive integers. It is possible to solve this problem by first increasing the sampling rate to 
L/T, then downsampling by / to produce the desired rate. Concatenating Figure 3.32 and Figure 3.33 
produces the combined system in Figure 3.34. Let R = max(L, M). Mathematically, the new signal can 
be written as 


ca >> x[m| sinc((nM — m)/R). (3.414) 


m 


rin] = 


Gain L 


Cutoff 
min(1/L, 1/M) 


Figure 3.34 Changing the sampling rate by a factor of L/M, also known as resampling 


If the sampling period is greater than that of the original signal, then effectively the signal is lowpass 
filtered to avoid aliasing. Changing the sampling rate is useful in communication systems to correct 
for mismatch between the available hardware and the signal being processed. (For example, in 
software-defined radio the analog-to-digital converter may operate at a very high sampling rate, 
which is unnecessary to process a low-bandwidth signal.) 


3.5 Linear Estimation 


In subsequent chapters it will be useful to find estimates of unknown parameters. In this section, we 
review some background on linear algebra, then introduce three important estimators from statistical 
signal processing: linear least squares estimators, maximum likelihood estimators, and linear 
minimum mean squared error estimators. Under some assumptions, linear least squares is also the 
maximum likelihood estimator. We make use of linear least squares extensively in Chapter 5. Many of 
the results can be extended to linear minimum mean squared error estimation. 


3.5.1 Linear Algebra 

In this book, we denote matrices with bold capital letters like A and vectors (always column) with 
lowercase bold letters like b. We refer to the k"" row and €™ column of A as [A] ke) = x,e- We refer 
to a, as the C column of A. The k"" entry of b is b,. The identity matrix I is square with 1 on its main 
diagonal and zero elsewhere. The vector e, refers to the k“" column of I; thus it is a vector witha 1 in 


the k* row and zeros otherwise. We use 0 to denote a zero vector or matrix. The notation C’ is used 
to denote the space of N-dimensional complex vectors and C'*" to denote the space of N x M 
complex matrices. We similarly define RY and R‘ *™ for real vectors and matrices. 

Let A denote an N <x M matrix. If N= M, we say that the matrix is square. If V > M, we call the 
matrix tall, and if M > N, we say that the matrix is fat. We use the notation A! to denote the transpose 


of a matrix, A* to denote the Hermitian or conjugate transpose, and A‘ to denote the conjugate of the 


a ae 7. 

; eae = |2 
entries of A. Let b be an N x 1 vector. The 2-norm of the vector is given by Ib] V din=1 |nl 
Other kinds of norms are possible, but we do not consider them in this book. 

The inner product between two vectors a and b is given by (a,b) =a*a. A useful result is the 
Cauchy-Schwarz inequality: ('a,b)| < a |{al||[b|| where equality holds when b = Ba for some scalar B 
(essentially when a is parallel to b). The Cauchy-Schwarz inequality applies to functions in Hilbert 
space as well. 

= 
Consider a collection of vectors {Xn tn=1, We say that they are linearly independent if there does 


Z : K aN wy NT 
not exist a set of nonzero weights {Qn }n=1 such that >, dX, = 0. InC’ and R, at most N vectors 


can be linearly independent. 

We say that a square matrix A is invertible if the columns of A (or equivalently the rows) are 
linearly independent. If A is invertible, then a matrix inverse A“! exists and satisfies the properties 
AA !=A7A=L IFA is tall, we say that A is full rank if the columns of A are linearly independent. 
Similarly, if A is a fat matrix, we say that it 1s full rank if the rows are linearly independent. 

Consider a system of linear equations written in matrix form 


Ax=b (3.415) 


where A is the known matrix of coefficients, sometimes called the data, x is a vector of unknowns, 
and b is a known vector, often called the observation vector. We suppose that A is full rank. If V= M, 
then the solution to (3.415) is x = A 'b. This solution is unique and can be computed efficiently using 
Gaussian elimination and its variations [131]. If A is square and low rank or fat, then an infinite 
number of exact solutions exist. It is common to take the solution that has the minimum norm. If A is 
tall and full rank, then an exact solution does not exist. This leads us to pursue an approximate 
solution. 


3.5.2 Least Squares Solution to a System of Linear Equations 


Consider again a system of linear equations as in (3.415). Now suppose that V > M and that A is still 
full rank. In this case A is tall and the system is overdetermined in general. This means there are V 
equations but / unknowns; thus it is unlikely (except in special cases) that an exact solution exists. In 
this case we pursue an approximate solution known as linear least squares. Instead of solving (3.415) 
directly, we propose to find the solution that minimizes the squared error 


\| Ax — b]]?. (3.416) 


There are other related least squares problems, including nonlinear least squares, weighted least 
squares, and total least squares, among others. We focus on linear least squares in this book and drop 
the term /inear except where there is the possibility of confusion. 


One direct approach to solving (3.416) is to expand the terms, differentiate, and solve. This is 
cumbersome and does not exploit the elegance of the linear system equations. Alternatively, we 


exploit the following facts (see, for example, [143, 2.3.10] or the classic reference [51]). Let f(x, x°) 
be a real-valued function of complex vector x and possibly its conjugate x°. Then x and x° may be 
treated as independent variables, and the direction of the maximum rate of change is given by 

d \ a iae oe D/Ape c\|T : : . : 
dx f(x) = [0/0x4f(x),.--, O/OxN f(x, x°)| Furthermore, the stationary points of this function 
can be found by setting this vector derivative to zero. Many tables for vector derivatives exist [32, 


d ,+* d + it ath ae = * 
220]. We find the following useful: dx * * = O, yeXA=A and dxe* A‘ AX = A AX 
Now we can solve the problem in (3.416): 
d 9 a « d | d oe ) : 
|| Ax — bl|* = x" A* Ax — —x"*A*b — b* Ax + b*b (3.417) 
dx® ax dx axe dx® | 


A* Ax= A*b. (3.418) 


Setting (3.418) equal to zero gives the orthogonality condition 
A*(Ax —b) =0 (3.419) 
and then the normal equations 
a AX => A’B: (3.420) 
Note that A*A is a square and invertible matrix (invertible because A is full rank); thus it is possible 
to solve (3.420) for 
xis = (A*A)~'A*b (3.421) 
where the subscript LS denotes that this is the least squares solution to this set of linear equations. 


We use the squared error achieved by x; , to measure the quality of the solution. To derive an 
expression for the squared error, note that 


J(xis) = || Axis — bl]? (3.422) 
= xjsA*(Axzs — b) — b*(Axzs — b) (3.423) 
= b*b — b’* Axrs (3.424) 
where the first term in (3.423) is zero, which follows from the orthogonality property in (3.419). 
Substituting for x; 5, 
J(xis) = b*b — b* A(A*A)~1A*b (3.425) 
= b*(I — A(A*A) *A*)b. (3.426) 


The special matrix A(A*A) !A* is known as a projection matrix for reasons that become clear when 
viewing least squares froma graphical perspective [172]. 


Now we present a classic visualization of least squares. Suppose that NV = 3, M = 2, and x, A, and 
bare all real. The relevant quantities are illustrated in Figure 3.35. The observation vector b is 
unconstrained in IR°. The objective is to find the best linear combination of the columns of A, in the 
sense that the squared error is minimized. The vector Ax is a linear combination of the columns of A, 
which lives in the subspace denoted as span(A), in this case a plane. An exact solution to the set of 
equations 1s possible only when b can be exactly represented by a linear combination of the columns 
of A, in this case being in the same plane defined by the span(A). The least squares solution gives x; 5 


such that Ax; g is the orthogonal projection of b onto span(A). That projection is given by A(A*A) 
~!A*b where the matrix A(A*A) !A* is a projection matrix. The error vector is b— Axy. It is 
orthogonal to Ax; 5 according to the orthogonality condition in (3.419), and has the smallest 2-norm 
among all error vectors. The error can be written as (I— A(A*A) !A*)b where (I— A(A*A)!A*) is 


a matrix that projects b onto the orthogonal complement of span(A). The length of the error is given in 
(3.426). 


*. Axz¢ ~ 


\span(A) 


Figure 3.35 The typical illustration of a linear least squares problem. The objective is to find a 
vector Ax that is as close as possible to b in the minimum squared error sense. The resulting 
vector is the orthogonal projection of b on the space spanned by the columns of A and is given by 
Ax; 5 = A(A*A) [A*b. 


The least squares solution of a linear system is useful in many contexts in signal processing and 
communications. It can be used as an estimator (discussed further in Section 3.5.3) and as a way to 
choose parameters of a model. Least squares solutions also have deep connections with adaptive 
signal processing [172, 143, 294]. The least squares solution can be found using recursive least 
squares, where the least squares solution is updated as additional rows are added to A and additional 
entries in b. Least squares 1s also related with linear minimum mean squared error (LMMSE) 
estimation (discussed further in Section 3.5.4). 


An application of least squares is provided in Example 3.46. In this example, the coefficients of an 
affine signal model are selected. This example shows how to find the unknowns with and without 
linear algebra. Applications of least squares to wireless communications are found in Chapter 5. 


Example 3.46 
Consider a system that provides an output y[n] for an input x[n] at time n, where the input-output 
relationship is believed to have the following relationship: 

y(n] = 89 + fy2[n] (3.427) 


where Bp, and B, are unknown coefficients to be estimated. Let e[n] = y[n] — By — B,x[n]. Suppose that 
there are N > 3 observations y[0], y[ 1], ..., y[N— 1] corresponding to N known inputs x[0], x[1], ... 
eV 1]. 
* Assuming that B, = 0, find the least squares estimation of By by expanding the sum, 
differentiating, and solving. 


Answer: The sum squares error is 


N-1 
yo le[n]|? = >. ly[n] — Bol? (3.428) 


n=0 n=0 
N-1 
= pa (y*[n] — 89)(y[n] — Bo) (3.429) 
n=0 
N-1 N-1 
- :3 y[n]? — (s y" nl) Bo — b a) Bo +N8oPo- (3.430) 
n=0 n=0 n=0 


Taking the derivative of the sum squares error over 96 and setting the result to zero, we obtain 
the least squares estimation of B, as follows, 


By = Lenao_vln) (3.431) 
N 
which is the mean of the observations y[7]. 
¢ Assuming that By = 0, find the linear least squares estimate of 8; by expanding the sum, 
differentiating, and solving. 
Answer: The sum squares error is 


N-1 
= le[n} = ly[n] — B,2[n]|? (3.432) 
n=0 n=0 
N-1 
= = (y*[n] — 2*[n]G7)(y[n] — 2iz[n]) (3.433) 
n=0 
N-1 N-1 
=> yn)? - (s- vino) B1- 
n=0 n=0 
N-1 
(5: vi y(n]: rl) By + » |2z[n] "| G7 Bi. (3.434) 
n=0 n=0 


Taking the derivative of the sum squares error over 97 and setting the result to zero, we obtain the 
least squares estimation of B, as follows: 


N-1 
3 _ : eal y|n|ax*|n) 
pal == N-1 _ 
) ae |a[n]|? 


¢ Find the linear least squares estimation of By) and B, by expanding the sum, differentiating, and 


(3.435) 


solving. 
Answer: The sum squares error is 


N-1 


N=1 
», le[n]|? = >. ly[n] — Bo — Bi2[n]|? (3.436) 


n=0 n=0 
N-1 
= 3. (y*[n] — 85 — x*[n]G7)(y[n] — Bo — B12[n]) (3.437) 
n=0 
N-1 N-1 
— bs y*|n](y[n] — Bo — G1 sn) - p» (y[n] — 80 — Biz[n])} Bo- 
n=0 n=0 


N-1 
- bs x*{n|(y[n] — Bo — ast) a (3.438) 


Taking the derivative of the sum squares error over 96 and setting the result to zero, we obtain 
N-1 
S— (yl[n] — 60 — 61x[n]) =0. (3.439) 


n=0 


Taking the derivative of the sum squares error over ST and setting the result to zero, we obtain 
ye x*[n}(y[n] — Go — Ba[n]) =0. (3.440) 


Solving the linear equations (3.439) and (3.440) gives us the least squares estimation of By and 
B, as follows: 


= (Sno e*tnl) (Xnco' wll) ~ (Saco ated) (NnKo 2*lelyln) 
- N (SNp a(n) — (SMG e*tnl) (SONG ain)) 
N (Np 2" Inlyln}) -— (ORS 2*In)) (OATS ale) 

N (Sonco zln]) — (azo e*Inl) (Ao ain) 


* Use the least squares estimation method to find the coefficients By and /, based on the known 
inputs and their corresponding observations using the matrix form. 


(3.441) 


A= (3.442) 


Answer: In matrix form, the given model can be written as 


y(0) 1 x0] 


y(1] 1 z{1] 8 
= |. i, (3.443) 
Fi ‘ Dy 
y|N — 1] l avV= || ¢€ 
Nec sommes? Nee eee” 
y x 
The sum squares error is given by 
N-1 
b) = a. ly[n] — Bo — B12[n]|? (3.444) 
n=0 
=ly — Xb)’. (3.445) 
The formulation of the least squares problem for estimating By and B, 1s 
brs = arg min J(b). (3.446) 


bec? 


¢ Since NV > 3, then X is tall and is likely to be full rank. Assuming this is the case, the least 
squares solution to this problem is given by 


brs = (X*X) 'X"*y. (3.447) 


Because the matrices are 2 x 2, itis possible to simplify the answer even further: 


i = N— 
bis = | wot Ynao an Pero ule (3.448) 
5 |okatet ia] SNF alnjetin]} [ACE ot inlylnl 
- 1 
NY neo eln|a*[n] — | On elnll? 
nao elnja*[n] —onc9 efn]] | Yn ln 5s 
x a ee ss N— 3 (3.449) 
_ aa wT [n] N pala =o 2 *In|y lr v| 
The squared error given by the least squares solution is 
J(bus) = y*y — yy’ Xbys (3.450) 


N-1 N-1 


= » ly[n]|? — Bo dX y(n] — 31 y"[n]x[nJ. (3.451) 
n=0 


n=0 


3.5.3 Maximum Likelihood Parameter Estimation in AWGN 


The least squares solution is used extensively in this book as a way to estimate parameters when the 
observations are corrupted by noise. In this section we establish that under some assumptions, the 
least squares solution is also the maximum likelihood solution. This means that it performs well as an 
estimator. 


Consider a system of equations where the observations are corrupted by additive noise: 


y = Ax+ Vv. (3.452) 


We call y the N x 1 observation vector, A the NV x M data matrix, x the M@ x 1 vector of unknowns, and 
vis anN x | random vector called additive noise. The objective 1s to estimate x given an observation 
y and knowledge of A but with only statistical knowledge of v—the instantaneous value of the noise 
is unknown. This is an estimation problem. 


Now we focus on the special case where everything is complex and the additive noise is circularly 


symmetric complex Gaussian distributed with distribution Ne(0,0°T) th this case the system is said 
to have AWGN. 


There are different objective functions that can be used when performing estimation. A common 
estimator in signal processing and communications is based on the maximum likelihood objective 
function. The likelihood function corresponding to (3.452) is the conditional distribution of y given A 
and supposing that x the unknown takes the value X. Since v is circularly symmetric complex 
Gaussian, given AX, then y is also circularly symmetric complex Gaussian with mean 


Eyja.x[y|A, x] = AX (3.453) 
and covariance 
Eyja.x((y — AX)(y — AX)*] = 071. (3.454) 
Therefore, the likelihood function is 
Fytax(V1A,®) = Spe A8 OAM), (3.458) 
Taking derivatives with respect to X 
~ fytax(¥lA,2) = = Aty — Ax)e~ 9~ AX) AR) (3.456) 


ax® 
and setting the result equal to zero gives 
A*(y — Ax) = 0, (3.457) 
which is exactly the orthogonality condition in (3.419). Assuming that V > M and A is full rank, 
xmi = (A*A)~'A*y. (3.458) 


The main conclusion is that linear least squares is also the maximum likelihood estimator when the 


input and output are linearly related and the observations are perturbed by AWGN. The maximum 
likelihood estimator is a good estimator in the sense that as the number of observations grows large, it 
converges in probability to its true value [175]. In this case it is also the best linear unbiased 
estimator. As a result, the specialization to least squares estimators in this book is not a major 
limitation, and the algorithms presented in subsequent chapters will be useful in practical wireless 
systems. 


3.5.4 Linear Minimum Mean Squared Error Estimation 


Another objective function that is widely used in statistical signal processing is minimizing the mean 
squared error [172, 143, 294]. It is perhaps more widely used than maximizing the likelihood because 
it requires fewer statistical assumptions. This section reviews the linear minimum mean squared error 
(MMSE) estimator. There are many different variations of the MMSE, but we focus specifically on 
the linear case, dropping the term /inear unless there is possibility for confusion. We consider a 
vector case that allows us to relate the results to the least squares solution and also has applications 
to equalization in MIMO communication systems in Chapter 6. Most of the results using least squares 
estimators can be modified with some additional terms to become MMSE estimators. 


Suppose that there is an unknown M x 1| vector x with zero mean and covariance C,, and an 


observation vector y with zero mean and covariance Cy 


on, Ee : 
with correlation matrix Cyx = Elyx"] Both x and y are assumed to be zero mean, which can be 
relaxed with some additional notation. The objective of the linear MMSE estimator is to determine a 
linear transformation such that 


Guise = arg minE [|x — G*y||?]. (3.459) 
z 


. The vectors x and y are jointly correlated 


Let x,, = [x], and g,, =[G].,,, Then 


M 
Gamsr = arg minE ps [Xm — Ca? ; (3.460) 
G 


m=1 


Now we focus on solving for one column of Gyyysp. Interchanging the expectation and derivative, 


M M 
d —_ d is — 
E | ) Xm — cov? =E | ) Xm — co? (3.461) 


dgy. m=1 dg; m=1 
d 4c 9 - 2) 
= E | = [Xk — Sky! (3.462) 
dgy 
= Ely (y*g, —x*)). (3.463) 


Now taking the expectation and setting the result to zero leads to the MMSE orthogonality equation 
Cyy 8k = [Cyx]:,k: (3.464) 


Solving gives 


Ek,MMSE = Cyy[Cyx]:,x- (3.465) 


Now reassembling the columns of G and combining the results gives the key result 


Gass = CyyCyx. (3.466) 
Based on (3.466), the MMSE estimate of x is then 
XMMSE = Gyumspy (3.467) 
= Cy.CY: (3.468) 
The effect of Cyy is to decorrelate y, while Cyx = Cxy is to exploit the joint correlation to 


extract information about x from y. 
The performance of the MMSE estimator is characterized by its MSE: 
M M 


¥. E [|Xm — Sm,mMseY|"] = E [|XmXm — Sm,MMSEYXm- | (3.469) 
m=1 m=1 
M 
= a [Crcse}anam _ Sin. MMSE|Cyxl]:,m (3.470) 
m=1 
M 
= >, [Cxx]m,m - [oe oe [Cyx]:,m (3.471) 
m=1 
= tr[Cxx] — tr [Cy,,Cl Cys] (3.472) 


where the final simplification comes from the tr[ A] operation that sums the elements of A. 


Now consider the model in (3.452) where A is known, x is zero mean with covariance C,,, v is 


zero mean with covariance C,,,, and x and v are uncorrelated. Then the covariance of y is 


Cyy = Elyy*| (3.473) 
= E[Axx* A*] + E[vv*] (3.474) 
= AC,,A* + Cyy (3.475) 
and the cross-covariance is 
Cyx = E [yx*] (3.476) 
= E[Axx* + vx"] (3.477) 
pa (3.478) 


Putting everything together in (3.468): 
XMMSE = CxxA*(AC,,A* + Cyy)7"y. (3.479) 


In the special case where C,,, = 07I (the entries of v are uncorrelated) and C,, = y7I (the entries of x 
are uncorrelated), then 


XMMSE = 7° A*(7y7AA* + 07I) “ty (3.480) 
o? =e 
= A* (aa: + 71) y. (3.481) 


In this case, the MMSE estimate can be viewed as a regularized version of the least squares estimate 
or maximum likelihood estimate in (3.458). The regularization has the effect of improving the 
invertibility of A. The ratio of y7/o? corresponds to what is usually called the signal-to-noise ratio 
(SNR) (where x is the signal and v is the noise). If the SNR 1s large, then 
5 \-1 
A*(AAT+ 51) SA*(AA*) | | 
. Tp ( , which is the least squares solution. If the SNR is small, 
2 \-!l 3 
A*(AAT+ 31) > A*Z 


then 7”, which is known as a matched filter. 


One major question in the application of MMSE techniques in practical systems is how to obtain 
the correlation matrices Cy, and C,,. There are two common approaches. The first is to make 
assumptions about the model that relates y and x and the underlying statistics, for example, assuming 
an AWGN model as in (3.452). Another alternative is to work with random processes and to use 
ergodicity to estimate the correlation functions. An application of this approach is illustrated in 
Example 3.47. 


Example 3.47 
Consider an input-output relationship 


y[n] = hs[n] + v{[n] (3.482) 


where s[m] is a zero-mean WSS random process with correlation r,,[”], v[”] is a zero-mean WSS 
random process with correlation r,,,[”], and s[m] and v[n] are uncorrelated. Assume that the scalar h 
is known perfectly. 
Suppose that we want to find the estimate of s[”] obtained from gy[] such that the mean squared 
error is mimimized: 
Ele[n]|?_ = Els{n] — gy[n]|?. (3.483) 


¢ Find an equation for g. First expand the absolute value, then take the derivative with respect to 
g* and set the result equal to zero. You can assume you can interchange expectation and 
differentiation. This gives the orthogonality equation. 


Answer: Since s[n] and v[n] are zero-mean random processes, that is, m, = 0 and m, = 0, the 
mean of y[7] 1s m, = hm, + m, = 0. Substituting the expression of y[7] into the expression of 


E{le[{n]|?] and expanding, 


E [le[n]|*] = E [|s[n] — gy[n|*] (3.484) 
= E[(s[n]s*[n] — 2s[n]y*[n]g* + gy[n|y*[n}g*)) . (3.485) 
Taking the derivative with respect to g* and setting it to zero, we obtain the following equation: 
E |(s{n] — gy|[n])y*[n]] = 0. (3.486) 
Using correlations and setting equal to zero, 
Psy|[O] = gryyl0}, (3.487) 
which is the orthogonality condition. 


¢ Simplify to get an equation for g. This is the MMSE estimator. 
Answer: Simplifying the orthogonality condition gives the MMSE estimator: 


JMMSE = Tyy [O]rsy[0] (3.488) 


(hh*rss(0] + Tvv[0])~*1s5[O]h*. (3.489) 


¢ Find an equation for the mean squared error (substitute your estimator in and compute the 
expectation). 
Answer: The mean squared error corresponding to the MMSE estimator is given by 
E [le[n||?] = (1 — gumsph)rss[0] (3.490) 
= (1 — (hh*rg5[0] + rov[0])~'7s5[0]h*h) r55[0]. (3.491) 
* Suppose that you know r,,[n] and you can estimate r,, [7] from the received data. Show how to 
find r,,.[7] fromr,,[n] and r,,[7]. 


Answer: Since s[n] and v[n] are uncorrelated and m, = m, = m, = 0, we have 
Tyy(n] = hh*res|n] + Torn}, (3.492) 


thus 
Tov [n] = ryy[nr] — hh*r,,[n]. (3.493) 


* Suppose that you estimate r,,.[n] through sample averaging of NV samples exploiting the 
ergodicity of the process. Rewrite the equation for g using this functional form. 


s\N-1 dle ee 0 be 
Answer: Assume the samples {y[7] }n=0 correspond to the input { s[n] iia, which is also 
known. An estimate of the autocorrelation of the input is 


N—1 274 
Pss 0] —_ paid a [n] (3.494) 


An estimate of the autocorrelation of the output is 


=i. * |, 
ryy|0] = 2anz0 uae in) (3.495) 


Therefore, the MMSE estimator corresponding to the samples is given by 
GMMSE = Tyy [O]rss[0]h* (3.496) 


N-1 * f 

Xn=o Inls"nl 5. (3.497) 

ae y[n|y*|n] 
N-1. ok 

2enzo S{ny"ln} (3.498) 

Sen=o Ylnly*(n] 


where (3.497) follows from replacing r,,[0] and r,[00] with their sample estimates and (3.498) 
follows from r,,[0] =7,,[0]/* and replacing with iB sample average. 


~ 


lo) N—-1 wl N-1 __. 
* Now consider the least squares solution where you know {s[n]}n—o and {yl"|}n—o . Write the 
least squares solution gy 5. Explain how the least squares and the linear MMSE are related in 


this case. 
Answer: The least squares solution is given by 


N-1 ‘ 
pear s{n|y [7] 2 AQC 
ae (3.499) 
en=o Yln|y* [nl] 
Note that LS = JMMSE. The explanation is that exploiting ergodicity in the linear MMSE— 


that is, taking no advantage of knowledge of the statistics of the random variables—does not 
give better results than the LS. 


gLs = 


Example 3.48 
Consider an input-output relationship 


yk = H {k\s[k] + vk] (3.500) 


where y[k] is the observed signal, H[A] is a known scaling factor, s[k] 1s an IID sequence with zero 


mean and variance 1, and v[k] is additive white Gaussian noise with distribution N (0, 0”). Suppose 
that s[A] and v[A] are independent. 


¢ Determine the least squares estimate of s[A]. 
Answer: This is the usual least squares problem with | x 1 matrices. Thus 
Sis[k] = (H*[k)H[k])~'H*[kly[k] = A[k)~ty[k) , which follows because we can 
divide through and cancel H*[k] assuming it is nonzero. 

¢ Determine the linear MMSE estimate of s[]. In other words, solve the problem 


Sumse[k] = GYmselklylAl (3.501) 
where 


Guse|k] = arg min E||g*y(k] — s[k]|?). (3.502) 
g 


Answer: The formulated problem is just a variation of (3.459). The solution is 


Gumse|k] = Se /ys (3.503) 
= (|A[k]|? + 0?)-' Ak] (3.504) 
H{k se 
= , 3.505 
ARI + i 
Then 
” H*|k} 
SmMsE|k| = =p KI 3.506 
SMMSE|| ee + saul | (3.506) 


3.6 Summary 


¢ Convolution in the time domain corresponds to multiplication in the frequency domain for the 
CTFT and DTFT. For the DFT, that convolution is periodic. 


¢ According to the Nyquist sampling theorem, a continuous-time signal can be completely 
represented from its periodically spaced samples if the sampling period is less than one-half 
the maximum frequency in the signal. Aliasing results if the sample period is not small enough. 

¢ There are many ways to measure the bandwidth of a signal, beyond just absolute bandwidth. 
The bandwidth of a WSS random process is defined based on the Fourier transform of its 
covariance function. 


¢ Continuous-time LTI systems that process bandlimited signals can be replaced by a continuous- 
to-discrete converter, a discrete-time LTI system, and a discrete-to-continuous converter. 


¢ WSS random processes are fully characterized by their mean and correlation function. Sample 
averages are used to compute the mean and correlation function of an ergodic WSS random 
process. 


¢ Passband signals are associated with an equivalent complex baseband signal and a complex 
sinusoid that depends on the carrier f,. Upconversion creates a passband signal froma complex 


baseband signal. Downconversion creates a complex baseband signal froma passband signal. 


¢ When an LTT system operates on a passband signal, only the part of the system in the bandwidth 
of the passband signal is important. An equivalent discrete-time input-output relationship can be 
written based on complex baseband inputs, outputs, and an equivalent channel. Communication 
of passband signals through LTI systems is therefore fundamentally related to discrete-time 
signal processing. 

¢ Upsampling and downsampling are used to change the rate of discrete-time signals. Filtering, 
upsampling, and downsampling can be interchanged with suitable modifications to the filters. 


¢ The linear least squares estimator is also the maximum likelihood estimator for linear models 
with AWGN. 


Problems 


1. There are many occasions to consider the CTFT and the DTFT in wireless communication 
systems. You should be able to solve the following problems using the appropriate tables and 
transform properties. They can all be solved without computing any integrals or sums. Use x(t), 


x(f), x(e?"), and x[n] as appropriate when you are not given the explicit function. Use t for the 
time variable and n for the sequence index. For example, the CTFT of x(t — t) is (Je ?. 


(a) CTFT of x(the?nvot 
(b) CTFT of x()y(2) 


T rot | 
(c) CTFT oe pern h (7 )ax(t — T)dr 


(d) CTFT of 2?" forrect ( T), where f and T are constant 

(e) Inverse CTFT of x(f) + x(—f). How does this simplify if x(t) is real? 
(f) Inverse CTFT of x(af), where a > 1 

(g) DTFT of x[n]e?*/o" 

(h) DTFT of x[n]y[7] 

(i) DTFT ee eae h|{mja[n — m|] 

(j) Inverse DTFT of x(e?"*) cos(2n7;, ot) 

(k) Inverse CTFT of x(f)e?*/0 

(1) CTFT of x(t) * v(t) 

(m) CTFT of x(at), where a > 1 


(n) CTFT of x(t) cos(27f,t) 

(0) DTFT of x(n) cos(2mf,n) cos(2mf>n), where f, and f, are constant 

(p) CTFT of sin(27/,t)x(t — ¢,), where f, and f, are constant 

(q) DTFT of e?*/1"(x[n] + ax[n — 1]), where f; and a are constant 

(r) CTFT of cos(2af,)x(20) + sin(2af,t)x(2t + 1), where f; is constant 

(s) Inverse CTFT of tX(F — fr) + 5 x(f) + ax(f + fi ) where /; is constant 
(t) Inverse DTFT of x(e?"/)) — x *(e?/)eP"/"1, where f; and n, are constant 
(u) CTFT of e?/i4x(¢ — t,), where f; and ¢, are constant 

(v) CTFT of cos(2mf,t)x(2t) + sin(2af,t)x(2t + 1), where f; is constant 

(w) DTFT of cos(2af,n)x[n — 1], where f, is constant 

(x) Inverse CTFT of x(f—f,) + x*(Je?"", where f; and t, are constant 

(y) Inverse DTFT of x(e”/) cos(27f/ng), where np is constant 


2. Let x[n] be a discrete-time signal with the discrete-time Fourier transform (DTFT), (e?”). 
We define another signal as follows: 
(el*"aIn y+ain 
vn) = Mala) + 2} 


(3.507) 


(a) Determine the DTFT Ye?) of y[n] in terms of x(e?/). 
(b) Show that the DTFT of y[27] is equal to y(e”). 
3. Evaluate the Fourier transform of the damped sinusoid wave g(t) = e ‘ sin(2af,4)u(t) where 
u(t) is the unit step function. 
4. Verify the following properties of the CTFT and the DTFT. 
(a) Time shifting: 


alt= 7) re“ P"I7 x(f) aaa 

a[n — k]  eS27 FF y (ei? F) (3.509) 
(b) Frequency shifting: 

eT (t) os x(f — fo) (3.510) 

exited) a (el IM) (3.511) 


5. Let x(t) be a complex signal with bandwidth W and Fourier transform <(/). 
(a) Show that the Fourier transform of x*(t) is x*(—/). 


ee a ai 
(b) Show that the Fourier transform of real “(4) = 3 (x(f) + x*(—f)). 


(c) Find the Fourier transform of x(t) * x(Z). 

(d) Find the Fourier transform of x(t) * x*(t). 

(e) Find the Fourier transform of x(t) * x*(-1). 
6. Sketch the following sequence. 


dit= 1 :|n| <4,neven (3.512) 


0) : otherwise. 


and its corresponding DTFT (ezf). Show your work. Hint: You may need to use a 
combination of Fourier transform properties. 


7. Let x[n] be a discrete-time signal with a Fourier transform that is zero for 1/8 < |f| 1/2. Prove 
that 


niceeee > ~ aati sin(4(n — 4k)) sonia 


k=—oc 


8. The output y(t) of a causal LTI system is related to the input x(t) by the following: 


eS) + 1ly(t) = / | z(7r)z(t — r)dr — z(t), (3.514) 


‘> ©) 


where z(t) =e 7“ +5 (2). 
(a) Find the frequency response h(f) = y(f)/x(/) of this system. 
(b) Find the impulse response of this system. 

9. The even part of a real sequence x[m] is defined by 


Te\n| = z|n] are. (3.515) 


Suppose that x[7] 1s a real finite-length sequence defined such that x[”] = 0 for n <0 and n> N. 
Let X[k] denote the N-point DFT of x[7]. 


(a) Is ReLX[A]] the DFT of x,[n]? 
(b) What is the inverse DFT of ReLX[A]] in terms of x[n]? 


10. Let x[n] be a length NV sequence with X[k] denoting its N-point DFT. We represent the DFT 
operation as X[k] = F{x[n]}. Determine the sequence y[n] obtained by applying the DFT 
operation six times to x[7], that is, 


y(n] = FL FL FL FL FF {a[n| yyy. (3.516) 


11. Show that the circular convolution is commutative. 

12. Let g[n] and h[n] be two finite-length sequences of length 7 each. Ify,[n] and y-[n] denote the 
linear and 7-point circular convolutions of g[n] and h[n], respectively, express y-[n] in terms 
of y;[n]. 


13. The Fast Fourier Transform (FFT) Suppose that a computer program is available for 
computing the DFT: 


N=-1 
X[k] = y; a[njeI@n/N)kn ee | ec ae ees (3.517) 
n=0 


that 1s, the input to the program is the sequence x[n] and the output is the DFT X[k]. Show how 
the input and/or output sequences may be rearranged such that the program can also be used to 
compute the inverse DFT: 
, N= | 
en = N os: X [kiran xn =0,1,---,N—-1; (3.518) 
k=0 
that 1s, the input to the program should be X[k] or a sequence simply related to X[k], and the 
output should be either x[7] or a sequence simply related to x[n]. There are a couple possible 
approaches. 
(a) Compute the inverse DFT using the modulo property of X[A] and x[n]. 


(b) Compute the inverse DFT using the complex conjugate of X[k]. 
re. ert... rkn Tkn _ ,—j(27/N)kn 
14. Given the N-point DFT X[k] = din=o0 tln|Wy : Wr =e ere , answer the 
following questions: 
(a) Break it into four N/4-point DFTs in preparation for a radix-4 decimation-in-frequency FFT 
algorithm. Simplify the expressions as much as you can. 
(b) Compare the complexities of direct DFT computation and the suggested decimation-in- 
frequency FFT algorithm in (a). 
15. Computer This problem explores the impact of zero padding on resolution. 
(a) Construct the 128-point sequence x[] to be a cosine wave of 0.125 cycles per sample plus 


0.5 times a sine wave of 0.25 cycles per sample. Use the /ft function in MATLAB to 
compute the DFT of x[n]. Plot the magnitude of the DFT. 


(b) Pad 896 zeros at the end of the 128-point sequence and use the /ft function to compute the 
1024-point DFT of the zero-padded signal. Does the 1024-point DFT look the same as the 
128-point DFT of part (a)? If not, how is it different? 

(c) Now find the 1024-point DFT of a pulse sequence with width 128 (x[n] = 1 for n =0, 1, - - 
-, 127 and 0 otherwise). How would this DFT explain the differences between (a) and (b)? 

16. Consider the continuous-time signal 


a(t) =cos(l0mt) + sin(3at + 7/2). (3.519) 
Let T be the sampling period and reconstruction period. Let x,(t) be x(¢)s(t) where s(t) = >); o(¢ 
— kT). We uniformly sample x(t) to form x[n] = x(n7). 


(a) What is the largest value of 7 that we can use to sample the signal and still achieve perfect 
reconstruction? 


(b) Suppose we sample with 7'= 1/30. Please illustrate <(f), <,(f), and x(e?"/) in the interval 
—1/2T to 1/2T. Do not forget to label correctly. 
(c) What is x[n] for T= 1/30? 
(d) Suppose we sample with 7= 1/10. Please illustrate <(f), <,(f), and x(e?"/) in the interval 
—1/2T to 1/2T. Do not forget to label correctly. 
(e) What is x[n] for T= 1/10? 
17. Suppose that we wish to create a signal 


a(t) = 4 cos(2710°t) + 3cos(4710°t). (3.520) 


(a) What is the Nyquist frequency of x(t)? 

(b) What is the Nyquist rate of x(t)? 

(c) Suppose that you generate x(t) using a discrete-to-continuous converter operating at three 
times the Nyquist rate. What function x[1] do you need to input into the discrete-to- 
continuous converter to generate x(t)? 

18. Compute the Nyquist frequency and the Nyquist rate of the following signals: 


(a) x(t) = cos(221072) 


sin({ L007t) 


(b) x(t) = Cn cos(27107t) 


| 
( ) TY ~ ioe cos(2710°t) (Gg ) sin(27107t) 
Cc iat al . . JU \ 


19. Suppose that we wish to create a signal 


sin(1007t) 


a(t) = cos(2710°t as 
aa \~Fo00nt 


(a) Compute the Fourier transform of x(Z). 
(b) Sketch the magnitude of <(f). What is interesting about the spectrum? 
(c) What is the Nyquist frequency of x(t) denoted fy? 


(d) What is the Nyquist rate of x(t)? 
(e) Suppose that x(¢) is sampled with sampling rate 3f,/4. Sketch the spectrum of x(e?”/). 


Suppose that x(t) is sampled with sampling rate 3f,. Sketch the spectrum of x(ez/). 
(f) Supp mip mpling N p 


(g) Suppose that we want to generate x(t) using a discrete-to-continuous converter operating at 
two times the Nyquist rate. What function x[n] do you need to input into the discrete-to- 
continuous converter to generate x(t)? 

20. Sampling and Quantization A discrete-time signal is generated from a continuous-time 
signal through sampling. In a digital communication system, this discrete-time signal is then 
quantized in order to generate a digital signal, which 1s then transmitted over a link or channel. 
If the analog signal is sampled at rate f, and passed through a quantizer with Q levels, the 
digital signal at the output of the quantizer has rate f,log,(Q) bits/sec. Now consider the 


following problem: 
A digital communication link carries binary-encoded words representing samples of an input 
signal 
x(t) = 3cos(6007t) + 2 cos(1800zt). (3.522) 


The link is operated at 10, 000 bits/sec, and each input sample is quantized into 1024 levels. 
(a) What is the sampling frequency that was used in this system? 
(b) What is the Nyquist rate? 


21. Let x(t) be a real-valued continuous-time signal with maximum frequency 40Hz. Also, let y(t) 
= x(t — 1/160). 


° Ifx[n] =x(n/80), is it possible to recover x(t) from x[n]? Why or why not? 
° If y[n] =y(n/80), is it possible to recover y(t) from y[n]? Why or why not? 


¢ Is it possible to find y[n] from x[”] without performing any upsampling or downsampling 
operations by using an LTI system with frequency response h(f)? Find h(/). 


22. Consider the systems shown in Figure 3.36. Assume that h,(e?”) is fixed and known. 
Determine h,(e?"/), which is the transfer function of an LTI system, such that Vifn] =y[n] 1 the 
inputs to hy (eat ) and hy(e?at ) are the same. 

4) —> ye 


x(n] ——>(t4) 


x(n] 


Figure 3.36 Two equivalent LTI systems 


y|n| 


23. Consider Figure 3.37 where x[n] =x(nT) and y[n] = y[6n]. 


Figure 3.37 Interconnected continuous-to-discrete and discrete-to-continuous converters 
(a) Assume that x(t) has a Fourier transform such that <(/) = 0 for |f| > 300. What value of 7 is 
necessary such that x(e?") = 0 for 1/4 <|ff/< 1/2? 
(b) How should 7 be chosen such that y(t) = x(t)? 
24. Consider the following continuous-time signal: 


x(t)= S~ cos(0.6krt). (3.523) 
k=—-5 
Let x,(¢) be x(t) IIL,(t), where I= Y, 8(¢ — £7). 


(a) What is the largest value of T that we can use to sample the signal and still achieve perfect 
reconstruction? 

(b) Suppose we sample with 7'= 1/2. Please illustrate |x, (f)| in the period —1/2T to 1/2T. Do 
not forget to label correctly. 

(c) Suppose we sample with 7’ = 1/10. Please illustrate |,(7)| in the period —z/T to x/T. Do not 
forget to label correctly. 

25. Consider a random discrete-time signal x[”] = s[n]+e[n], where both s[] and e[n] are 
independent zero-mean WSS random processes. Suppose s[] has autocorrelation function 
R,,[n] and that e[n] is white noise with variance o”. Determine the mean and autocorrelation 
function of x[7]. 

26. Let x[n] and y[n] be two independent zero-mean WSS random signals with autocorrelations 
R,,[n] and R,,,[n], respectively. Consider the random signal v[n] obtained by a linear 


combination of x[”] and y[”], that is, v[m] = ax[n] + by[n], where a and 6 are constants. 
Express the autocorrelation and cross-correlations, R,,,.[”], R,,[7], and &,,[7], in terms of 


R,,[n] and R,, [7]. 


27. Let x[n] and y[n] be two independent zero-mean complex WSS random signals with 
autocorrelations R,,[n] and R,,,[”], respectively. Determine the mean and autocorrelation 
functions of the following functions: 


(a) x[n] + y[n] 
(b) x[n] — y[n] 
(c) x[n]y[7] 
(d) x[n] + y*[7] 
28. Suppose that s[”] is a complex WSS random process with mean m, and covariance C,,[k]. 
Find the mean and covariance for x[n] defined as 


29. Let x[n] be a random process generated by 
z|n] = az|[n — 1] + w[n] (3.525) 
where n > 0, x[-1] = 0, and w[7] is an IID N (0, 07,) process. Clearly x[n] is zero mean. 
(a) Find r,,[n, n+ k]. 
(b) If |a| < 1, show that asymptotically the process is WSS; that is, for = 0, show that lim,,_,., 


r,,[n, n + k] is only a function of k. 
30. Let x[n] be an IID discrete-time, discrete-valued random process such that x[”] = 1 with 

probability p and x[m] =—1 with probability 1 — p. 

(a) Write a general equation for computing the mean m,. of a discrete-valued IID random 
process. 

(b) Write a general equation for computing the covariance C\,[€] of a discrete-valued IID 
random process. 

(c) Determine the mean of m, of the random process given in this problem. 


(d) Determine the covariance C,,[¢] of the random process given in this problem. 


2 
31. Suppose that v[7] is an IID complex Gaussian random process with zero mean and variance 7» 
. Let a and b denote constants. Suppose that w[n] is an IID complex Gaussian random process 
} 


with zero mean and variance 7w. v[n] and w[n] are independent. 
(a) Compute the mean of y[”] = aw[n] + bw[n — 2] + v[n]. 
(b) Compute the correlation of w[7]. 
(c) Compute the correlation of v[7]. 
(d) Compute the correlation of y[7]. 
(e) Compute the covariance of y[n]. 
(f) Is y[n] wide-sense stationary? Please justify. 
(g) Is y[n — 4] wide-sense stationary? Please justify. 
(h) How can you estimate the correlation of y[n] given samples y[0], y[1], ..., v[99]? 
32. Suppose that w[n] is an IID complex Gaussian random process with zero mean and variance 
os Let a, b, and c denote constants. 
(a) Compute the correlation of w[7]. 
(b) Compute the mean of y[”] = aw[n] + bw[n — 2]. 
(c) Compute the correlation and covariance of y[7]. 
(d) Is y[n] wide-sense stationary? Please justify. 
(e) Is y[m — 4] wide-sense stationary? Please justify. 
(f) Let v[m] be a random process generated by v[n] = co[n—1]+w[n], where n = 0 and v[-1] = 
0. Compute the correlation of v[7]; that is, compute r,,,[n, n + k]. 
(g) If |c|< 1, prove that lim,_,.,7 7),[”, 1 + k] 1s only a function of k for k = 0. 


33. Toss a fair coin at eachn, —] <n <oo 1, and let 


) 


in] +S, if heads is the outcome, P(H) = ; (3.526) 
wi|n| = f x " 3.526 
—S, if tails is the outcome, P(T) = ; | | 


where S'> 0. w[n] is an IID random process. Now define a new random process x[n], n = 1 as 


x{n| = ? w|i]. (3.529) 


Find the mean and covariance function of the process x[n], whichis known as a random walk. 
Is this process wide-sense stationary? Random walks are often used to model node location in 
simulations and analysis of mobile users in wireless system performance. 


34. Consider the following received discrete-time random process defined by the equation 


y(n] = 2[n] — v{n] (3.530) 


where x[n] and v[] are independent WSS random processes with means p, and p,, and 
autocovariance C\,[t] = (1/2) ‘ and C,,,[t] = (-1/4) *, respectively. 

(a) What is the mean of y[n]? 

(b) What is the autocorrelation of y[”]? 

(c) What is the autocovariance of y[n]? 

(d) What is the power spectrum of y[7]? 


(ec) Assuming that x[7] is the “signal” and — v[7] is the “noise,” calculate the signal-to-noise 
ratio. Remember, this is the ratio of the variance of the signal to the variance of the noise. 


35. Consider a continuous-time zero-mean WSS random process x(¢) with covariance function 
Cyx(t) = eF. 
(a) Compute the power spectrum P,,,(f) of x(¢). 
(b) Determine the half-power bandwidth (or 3dB bandwidth) of the signal. 
(c) Determine the sampling period 7 such that you sample the signal at twice the 3dB frequency. 
(d) Compute the covariance function of x[n] =x(n7). 
(ec) Compute the power spectrum Perv ) of the sampled signal. 
(f) Is information lost in going from continuous to discrete time in this case? 


36. Consider again the same setup as problem 35, but now we use fractional containment to define 
bandwidth. 


(a) Determine the power spectral density P,(f/) of x(¢). 
(b) Compute the 3dB bandwidth of x(Z). 


(c) Compute the fractional power containment bandwidth with a = 0.9, that is, the bandwidth 
that contains 90% of the signal energy. 


(d) Find the sampling period 7 such that you sample x(t) at twice the 3dB frequency. 
(ec) Determine the covariance function of x[n] = x(n7). 


(f) Compute the power spectral density P(e?) of x[n]. 
37. Computer Consider a discrete-time IID complex Gaussian WSS random process with zero 
mean and unit 1, given by w[n]. Let x[n] = w[n] + 0.5w[n — 1] + 0.25w[n — 2]. Generate a 
realization of length 256 of x[m] in your favorite computer program. 
(a) Plot Mx from (3.195) for N=5, 10, 20, 50, 75, 100, 256 and explain the results. 


i 
(b) Plot Rzz{k] from (3.196) for k= 0, 1, 2 for the same values of N. Compare with the true 
value of R,.,[k] that you compute analytically. 
38. Consider a continuous-time system with input random process x(t) and output process y(t): 
(3.531) 


y(t) = : | x(t — s)ds. 


Assume that x(t) is wide-sense stationary with power spectrum P,(f) = 8 for —o0 < f'<co, 


(a) Find the power spectrum of the output P,(/). 
y. 


(b) Find the correlation of the output R,,,(Z). 
39. Consider a continuous-time WSS random process x(t) with correlation function C\(t) = el 


(a) Compute the power spectrum P,(f) of x(¢). 

(b) Determine the half-power bandwidth (or 3dB bandwidth) of the passband signal. 
40. Another important Fourier transform relationship is the Wiener-Khinchin theorem, which 
states that the power spectral density of a WSS random process is equal to the Fourier 
transform of the autocorrelation function. Consider a time-truncated function x(t, 7): 

gerade 


> 5 


ra 


x(t) 
: otherwise. 


ae y= 0 


The energy spectral density of x(¢, T) 1s given as 
See(f,T) = E[|X(f) 


4 i ( if 2) 


and the power spectral density is given as 
Sst 
fi 


Sex(f) = tim 


Sne(f) = F {Rez(T)} 


For this problem, prove the Wiener-Khinchin theorem. In other words, show that 


where F is the Fourier transform operator and 
Rz2(T) = E[x(t)2*(t — 7)| 


is the autocorrelation of a function x(f). 


41. Suppose that _Y is a complex normal random variable with distribution Ne(m, 0? ) 
(a) What is the mean of X? 
(b) What is the variance of X? 
(c) What is the mean of ReLX]? 
(d) What is the value of E [Re[X |Im|[X]} 
42. Consider the following passband signal: 


tp(t) = e “*u(t) cos(2 x 10°zt), (3.537) 
where a = 1000 and 
(t) 1 reo (3.538) 
u —— Oo. Dd 
? O fe<0. : , 


(a) Determine and plot |x,(f)|, where x,(f) is the CTFT of x,(¢). Do not forget to label correctly. 

(b) Find the complex baseband equivalent signal x(7) of x,(¢). Plot |x(/)|, where x(f) is the 
CTFT of x(t). Do not forget to label correctly. 

(c) Determine the absolute bandwidth of the passband signal. 

(d) Determine the 3dB bandwidth of the passband signal. 

(ec) Determine the 3dB bandwidth of the baseband signal. 


(f) Determine the Nyquist rate of the baseband signal, assuming we define the bandwidth by the 
3dB bandwidth. 


(g) Draw a block diagram of the entire system to create x,(¢), including D/C and upconversion. 


43. Consider a wireless communication system with the carrier frequency of f, = 2GHz and the 


absolute bandwidth of W= 1MHz. The propagation channel consists of a sum of attenuated 
reflections: 


99 
h.(t) = 5° (0.9)"6(t — n107"). (3.539) 
n=0 

(a) Determine the channel magnitude in the frequency domain, that is, |h,(f)| where h,(f) is the 
continuous-time Fourier transform of /,(¢). Do not forget to label correctly. 

(b) Find the passband channel /,(t) by applying an ideal bandpass filter p(t) with the center 
frequency at /, and the absolute bandwidth W to the channel /,(1). Sketch |h,(/)|, where h,(/) 
is the continuous-time Fourier transform of h,(¢). Do not forget to label correctly. 

(c) Find the complex baseband equivalent channel A(t). Sketch |h(/)|, where h(f) is the 
continuous-time Fourier transform of A(t). Do not forget to label correctly. 

(d) Find the pseudo-baseband equivalent channel /,,(¢). 


(ce) Find an equation for the discrete-time complex baseband equivalent channel h[n], assuming 


sampling at the Nyquist rate. 
44. Consider the following passband sinc? pulse signal: 
tp(t) = 2sinc?(2 x 10’t) cos (4.8 x 10° zt) , (3.540) 


sinc(a) = Sne# 


where 

(a) Determine and plot |x,(f)|, where x,(/) is the continuous-time Fourier transform of x,(¢). Do 
not forget to label correctly. 

(b) Find the complex baseband equivalent signal x(¢) of x,(¢). Plot |x(/)|, where <(f) is the 
continuous-time Fourier transform of x(t). Do not forget to label correctly. 

(c) Determine the absolute bandwidth of the passband signal. 

(d) Determine the half-power bandwidth (or 3dB bandwidth) of the passband signal. 

45. Consider the following passband sinc? pulse signal: 


7a 


p(t) = sinc?(3 x 10’t) cos (4 x 10°zt) , (3.541) 


sin(7a) 


Ta 


where Sinc(a) = 


(a) Determine and plot |x,(/)|, where x,(f) is the continuous-time Fourier transform of x,(¢). Do 
not forget to label correctly. 

(b) Find the complex baseband equivalent signal x(t) of x,(t). Plot |x(/)|, where x(f) is the 
continuous-time Fourier transform of x(t). Do not forget to label correctly. 

(c) Determine the absolute bandwidth of the passband signal. 

(d) Determine the absolute bandwidth of the baseband signal. 

(ec) Determine the Nyquist rate of the baseband signal. 


(f) Suppose we create x(t) using a discrete-time sequence x[n] with T chosen corresponding to 
twice the Nyquist rate. Determine x[77]. 


(g) Draw a block diagram of the entire system to create x,(¢), including D/C and upconversion. 


46. Let s(t) be a bandlimited signal such that s(f) = 0 for |< B < f,Hz. Also, let $(¢) be the Hilbert 
transform of s(t), where $\") is defined as 


px(t) = s(t) cos(27f.t) + s(t) sin(27 Sct), prove that x(t) is a single-sideband signal. 
47. Consider the following passband signal: 


yp(t) = V2 sinc(3t) cos(200zt). (3.543) 


(a) What is the carrier frequency of y,(t)? 


(b) Determine and plot |y,(/)|, where y,(f) 1s the continuous-time Fourier transform of y,(7). Do 


not forget to label correctly. 
(c) What is the absolute passband bandwidth of y,(¢)? 


(d) Find the complex baseband equivalent y(7) of y,(¢). 


(e) What is the absolute bandwidth of y(t)? 
(f) Draw a block diagram for downconversion for this system. 

48. Consider a wireless communication system with the carrier frequency of f, = 1900MHz and the 
absolute bandwidth of W = 500kHz. The two-path channel model is 


h(t) = 0.5 6(t — 2 x 107%) — 0.25 6(t —5 x 107). (3.544) 


(a) Determine and sketch the channel magnitude in both time and frequency domains, that is, 
|A(t)| and |h(f/)|, where h(/) is the continuous-time Fourier transform of h(t). Do not forget to 
label correctly. 


(b) Find the passband channel /,(¢) by applying an ideal bandpass filter p(¢) with the center 
frequency at /, and the absolute bandwidth W to the channel h(t). Sketch |h,(f)|, where h,(/) 
is the continuous-time Fourier transform of /,(¢). Do not forget to label correctly. 

(c) Find the complex baseband equivalent channel A(t). Sketch |h(/)|, where h(/) is the 
continuous-time Fourier transform of /(t). Do not forget to label correctly. 

(d) Find the pseudo-baseband equivalent channel /,,(¢). 

(ce) Find an equation for the discrete-time complex baseband equivalent channel h[n], assuming 
sampling at the Nyquist rate. Do not forget about the appropriate normalization factors. 

49. Consider a wireless communication system with the following channel model: 
r*, £20 


€ ; 
A(t) = 3545) 
Kf) 0 t<0. ais) 


(a) Determine the continuous-time frequency response of the channel and sketch its magnitude. 


(b) Suppose a baseband signal with bandwidth of B is input into a system with impulse 
response /(t). Determine the discrete-time equivalent system. 


50. Consider a wireless communication system with carrier frequency of f, = 2000MHz and 
absolute bandwidth of W= 1MHz. The channel model is exponential (common in practice): 


h.(t) = exp(—(t — 100 x 10~”))u(t — 100 x 107%). (3.546) 


(a) Determine and sketch the channel magnitude in both time and frequency domains, that is, 
|A,(t)| and |h,(f)|, where h,(/) is the continuous-time Fourier transform of /,(¢). Do not forget 


to label correctly. 

(b) Find the passband channel /,(¢) by applying an ideal bandpass filter p(¢) with the center 
frequency at f, and the absolute bandwidth W to the channel h(t). Sketch |h,(f)|, where h,(/) 
is the continuous-time Fourier transform of /,(¢). Do not forget to label correctly. The 
convolution can be done; you might consider using Mathematica if you have trouble. 


(c) Find the complex baseband equivalent channel A(t). Sketch |h(f)|, where h(/) is the 
continuous-time Fourier transform of h(t). Do not forget to label correctly. 


(d) Find an equation for the discrete-time complex baseband equivalent channel h[n], assuming 
sampling at the Nyquist rate. 


51. Consider a wireless communication system with the carrier frequency of f, = 1700MHz and the 
absolute bandwidth of W = 20MHz. The channel response is given by 


= Yeats 6(t — k x 10~®) (3.547) 


where 0<a< 1. 


(a) Determine and sketch the channel magnitude in both time and frequency domains, that is, 
|A,(t)| and [h,(f)|, where h,(f) is the continuous-time Fourier transform of /,(t). Do not forget 
to label correctly. 

(b) Find the passband channel /,(t) by applying an ideal bandpass filter p(t) with the center 
frequency at f, and the absolute bandwidth W to the channel h(¢). Sketch |h,(/)|, where h,(/) 
is the continuous-time Fourier transform of /,(¢). Do not forget to label correctly. The 
convolution can be done; you might consider using Mathematica if you have trouble. 


(c) Find the complex baseband equivalent channel A(t). Sketch |h(f)|, where h(/) is the 
continuous-time Fourier transform of h(t). Do not forget to label correctly. 


(d) Find the pseudo-baseband equivalent channel /,,(¢). 


(e) Find an equation for the discrete-time complex baseband equivalent channel h[n], assuming 
sampling at the Nyquist rate. 


52. Consider a wireless communication system with carrier frequency of f, = 2GHz and an 
absolute bandwidth of 1OMHz. Suppose that the sampled complex baseband signal to be sent is 
z{n| = exp(jmn) + exp(j3an/10). (3.548) 


(a) Suppose that x[7] is fed into a discrete-to-continuous converter operating at five times the 


Nyquist rate. What is the complex baseband equivalent signal x(t), which is the signal output 
from this converter? 


(b) Suppose that x(t) is modulated by the RF to produce the passband signal x,(7). Draw a 
block diagram that describes the modulation operations. 
(c) Write an equation for the passband signal x,(7). 


(d) What is the bandwidth of the resulting passband signal x,(7)? 
53. Passband-to-Baseband Conversion Consider the following passband signal: 
Yp(t) = V2 sinc(2t) cos(200zt). (3.549) 


(a) What is the carrier frequency of y,(t)? 
(b) What is the absolute passband bandwidth of y,(¢)? 


(c) Find the complex baseband equivalent y(t) of y,(¢). 
(d) What is the absolute bandwidth of y(t)? 
54. Consider the continuous-time signal 


x(t) =cos(10mt) + sin(3mt + 7/4). (3.550) 


Let T be the sampling period and reconstruction period. Let x,(¢) be x(t)W(E) when WE) = >. 

hy ee le : 
8(t — kT). We uniformly sample x(#) to formx[n] = x(n7). We filter "[7] = 4[n] — 346[n — 1] 
to produce y[n]. We produce y(t) using an ideal discrete-to-continuous converter. 


(a) What is the largest value of T that we can use to sample the signal and still achieve perfect 
reconstruction? 


(b) Suppose we sample with 7'= 1/30. Please illustrate |x,(/)| in the period —1/2T to 1/27. Do 
not forget to label correctly. 


(c) Again suppose that we sample with T= 1/30. Please determine y[7] and illustrate |y(f)| in 
the period —1/2T to 1/27. Do not forget to label correctly. 


(d) Suppose we sample with 7'= 1/10. Please illustrate |x,(/)| in the period —1/2T to 1/2T. Do 
not forget to label correctly. 


(ec) Again suppose that we sample with T= 1/10. Please determine y[] and illustrate [y(/)| in 
the period —1/27 to 1/2T. Do not forget to label correctly. 
55. Suppose that you have a continuous function f(x) that you want to approximate with a 
polynomial a + Bx + yx’. Find the values of a, B, and y such that the squared error 


1 
[ ¢@ a — Bx —yx*)*dx (3.551) 
JO 


is minimized. You can exchange integration and differentiation operations as needed. Assume 
that x is real. 


56. Suppose you model a desired signal y[”] as 
y(n] = ag +a, /n (3.552) 
and let e[n] =y[n] — ay + ayn. Suppose you are given y[n] forn=1,2,..., N. Find an 
expression for the coefficients ag and a, such that the poe =1 le[n]|? minimized. 


57. Suppose you model a desired signal x[n] using a p"*-order polynomial as 


y(n] = ag + nay + n?ag+...+ nay. (3.553) 
Suppose you are ene y[n] for n=0, 1, 2,...,.N. Find an expression for the coefficients 
{2% }—0 such that peli o |e [n]|* is minimized when e[n] = y[n] — (ag + na, +n?a,+...+ 
nay). 


58. Consider a system that provides an output y[”] for an input x[”] at time 7, where the input- 


output relationship is believed to have the following model: 
y[n] = Boz[n] + B,x°[n] (3.554) 


where Bp and f, are the complex coefficients to be estimated. A variation of this type of 

memoryless input-output relationship (for real signals) is a good model for a power amplifier. 
Let e[n] = y[n] — Box[n] — B,x°[n]. Suppose that we have N (N> 3) observations y[0], y[1], . 

., ¥LN — 1] corresponding to N known training symbols x[0], x[ 1], ...,x[M— 1]. Our goal is 

to use the least squares estimation method to find the coefficients By and B, based on the known 

training symbols and their corresponding observations such that Ue aa le[n}|? is minimized. 

(a) Assuming that B, = 0, find the least squares estimation of By by expanding the sum, 
differentiating, and solving. 

(b) Assuming that By = 0, find the least squares estimation of 8, by expanding the sum, 
differentiating, and solving. 

(c) Find the least squares estimation of By and B, by expanding the sum, differentiating, and 
solving. 

(d) Formulate and solve the least squares problem for finding B, and B, in matrix form. 


59. Consider a system that provides an output y[”] for an input x[”] at time ”, where the input- 
output relationship is believed to have the following model: 


y(n] = Por[n] + Bi 2[n]x*[n], (3.555) 


where fy and B, are the (possibly complex) coefficients to be estimated. Let e[n] = y[n] — 

Box[n] — B,x[n]x*[n]. Suppose that we have N (N = 3) observations y[0], y[1],...,y[N— 1] 

corresponding to N known training symbols x[0], x[1], ...,x[M— 1]. Our goal is to use the 

least squares estimation method to find the coefficients By and B, based on the known training 

symbols and their corresponding observations such that aa je[n| ? is minimized. 

(a) Assuming that B, = 0, find the least squares estimation of By by expanding the sum, 
differentiating, and solving. 

(b) Assuming that By = 0, find the least squares estimation of B, by expanding the sum, 
differentiating, and solving. 

(c) Find the least squares estimation of By and B, by expanding the sum, differentiating, and 
solving. 

(d) Formulate and solve the least squares problem for finding By and 2, in matrix form. Note 
that because of the matrix dimensions, you can find an expression for the inverse. 


60. Consider a continuous function f(x) = xe* for 0 <x < 1 that we would like to model witha 


simple polynomial g(x) = a+ Bx. Our goal is to find the values of a and B that minimize the 


ee 
squared error between f(x) and g(x). Hint: You might want to recall that I, f(e)de =1.,.4 
i. : 

J, wf(x)dz = e~* 


(a) Write an expression for the squared error function. Hint: You need an integral. 


(b) By taking derivatives with respect to a and 8, formulate a least squares problem in matrix 
form. 


(c) Solve for the optimal o and B. 
(d) Evaluate the mean squared error using the o and £ that you find. 


61. Suppose that w[n] is an IID random process with zero mean and unit variance. Let x[n] = w[7] 


+ 0.5w[n — 1]. Suppose that y[n] =x[n] + v[n] where v[7] is an IID random process with zero 


mean and variance o°. 


(a) First consider a scalar MMSE problem. Find the value of g such that 
E [|g*y[n] — x[n]|7] (3.556) 
is minimized. 
(b) Determine an expression for the resulting mean square error. 


TplK-1 
(c) Now suppose that we have an MMSE filter with taps {gf be=0 . Find the values of g such 
that 


c—1 


; 2 
E oy g |kly[n — k] — a[n] (3.557) 


k=0 


is minimized. 
(d) Determine an expression for the mean square error. 


Chapter 4. Digital Modulation and Demodulation 


The foundation of modern wireless communication systems 1s digital communication. The mapping of 
digital information onto waveforms at the transmitter is known as digital modulation. The extraction 
of the transmitted digital information from the noisy received signal is known as digital demodulation. 
General background on the principles of digital communication was provided in Chapter 2, to 
introduce important components of an entire digital communication system and relevant terminology. 
This chapter introduces a special class of digital modulation techniques that we call complex pulse- 
amplitude modulation and describes important mathematical concepts that underlie this type of 
modulation. There are two components of complex pulse-amplitude modulation that can be designed: 
the symbol constellation and the pulse-shaping filter. First we provide background on symbol 
mapping and constellations, including a review of several common constellations. We assume that the 
constellations are normalized to have zero mean and unit energy. After defining the bandwidth of the 
complex pulse-amplitude modulated signal, we introduce the AWGN channel, where the only 
impairment is additive Gaussian noise. The generalization to include frequency-selective channels is 
found in Chapter 5. Then we present results on designing the pulse shape for a communication channel 
where there is only AWGN. We describe how to implement pulse shaping using multirate digital 
signal processing. Finally, we examine the basic receiver operations. We derive the maximum 
likelihood symbol detector and then we characterize its performance through bounds on the 
probability of symbol error. 


4.1 Transmitter for Complex Pulse-Amplitude Modulation 


There are many different types of digital communication signals. In this book we consider the general 
class of modulation techniques that can be generated according to the abstract block diagram in 
Figure 4.1. We call this type of modulation complex pulse-amplitude modulation, since complex 
symbols amplitude modulate successive pulses. This diagram is meant to reflect the mathematical 
process of creating a baseband complex bandlimited waveform and does not exactly translate into a 
hardware implementation. A more realistic implementation of the filtering is provided in Figure 4.10. 
Pulse-shaping filter 
(to create bandlimited signal) 


Re[s[”]]]} Create 
pulse train 

Create 
pulse train 


Symbol 
mapping 


Im[s[7] ] 


J 
Figure 4.1 Abstract block diagram for generating a complex pulse-amplitude modulated waveform 


Now we explain each block in Figure 4.1, starting from the left and working toward the right. 


The source for digital modulation is a sequence of bits {b[n]} that the transmitter intends to send to 
the receiver. The bit sequence could have been generated from an error control code, as described in 
Chapter 2, but the presence of coding is not exploited in this or cue chapters. The physical 


layer often works with a finite sequence or block of bits {1 I} n= 0. In this section we consider the 
usual case where the receiver does not know the bits being sent by the transmitter, which is the entire 
reason for performing communication in the first place! In Chapter 5, though, we relax this assumption 
and exploit known information in the form of training signals to estimate and mitigate different 
channel impairments to further assist in the demodulation of an unknown bit sequence. 


The sequence of bits {b[]} is processed by the symbol-mapping block to produce a sequence of 
symbols {s[n]}. With complex pulse-amplitude modulation, each value of s[n] is a complex number 
that comes froma finite set of symbols called the constellation and 1s written as 


C = {cp, ...,Cm@—2,CM-1}- (4.1) 


The entries of the constellation are different (possibly complex) values. The size of the constellation, 
or cardinality, is denoted as |C| = M, where M is the number of symbols in the constellation. For 


practical implementation M = 2”, where b is the number of bits per symbol so that a group of b input 
bits can be mapped to one symbol. In more sophisticated digital communication systems, the symbols 
may be vectors or matrices, for example, as discussed in Chapter 6. 


The next two operations in Figure 4.1 generate a waveform from the symbol sequence {s[n]}. The 
first functional block performs the symbol-to-pulse-train operation to produce }),, Re[s[n]] 6(¢ — 17) 


and »,, Im[s[”]] 6(¢— nT). Note that the output consists of the real and imaginary parts of the symbols 
s[n] multiplying an ideal Dirac delta 6(t — nT). The spacing between symbols 1s 7, the symbol period. 

The pulse train sequence is passed to the pulse-shaping filter with impulse response g,,(t). The 
output is the complex baseband waveform 


Itx(t) * 2 s|njo(t — nT) = io s[n|otx(t — nT). (4.2) 


tT Th 


The right-hand side of (4.2) is reminiscent of the Nyquist reconstruction formula with g,,(7) acting as 
a reconstruction filter. Unlike perfect reconstruction, though, the pulse-shaping filter g,,(¢) 1s not in 


general a sinc function as in (3.82). The choice of transmit pulse shaping impacts the bandwidth of the 
transmit signal and other properties. In applications considered in this chapter, g,,(¢) is specially 


chosen to ensure that the transmitted waveform is bandlimited. Note that g,,(¢) 1s assumed to be real 
in this book, which is the most common scenario in commercial wireless systems. 


The output of the pulse-shaping filter is scaled by VEx to produce the complex baseband signal 


)= salt Be ba s(n|dtx(t — nT). (4.3) 
The scaling factor E, is used to model the effect of amplifying x(t) to add power. This chapter makes 
several normalization assumptions so that F/T represents the transmit power of the signal. This book 
primarily considers digital communication signals where the complex envelope of the signal is 


generated according to (4.3). 

We refer to x(t) as a complex pulse-amplitude signal because complex symbols modulate a 
succession of pulses given by the function g,,(t). Effectively, symbol s[7] rides the pulse g,,(t-7). 
The symbol rate R, corresponding to x(¢) 1n (4.3) is 1/7. The units are symbols per second. Typically 
in wireless systems this is measured in kilosymbols per second or megasymbols per second, but 
emerging commercial systems like millimeter-wave systems measure data rates in gigasymbols per 
second [268]. Be careful to note that 1/7 is not necessarily the bandwidth of x(t), which depends on 
2,,(t) as will become clear in Section 4.3. The bit rate is b/T (recall that there are 2? symbols in the 
constellation) and is measured in bits per second. The bit rate is a measure of the number of bits per 
second the baseband waveform x(f) carries. 


Example 4.1 


In this example, we provide a visualization of x(t) in (4.3). Suppose that pulse-amplitude modulation 
is used with VM = 4, whichis called 4-PAM. This is a real constellation that is not widely used in 
wireless communication but is convenient for illustration purposes. To simplify the plot, set £, = 1 


and 7 = 1. Suppose that the bit-to-symbol mapping is given in JTable 4.1. This is an example of Gray 
coding where neighboring symbols have only 1-bit difference. 


¢ Assuming the bit sequence is {b[n]} = {0010011100}, starting from 2 = 0 to n = 7, determine 
the sequence of symbols. 

Input | Output 
00 | —3/75 
01 | -1/v5 
10 | 3/v5 
11 1/75 

Table 4.1 A Bit-to-Symbol Mapping for 4-PAM 


Answer: The first pair of bits gives symbol ° [0] = -3/ V5, The rest of the pairs give 
s[1] = 3/V5, s[2] = -1/V5, s[3] = 1/V5, ana 8[4] = —1/V5. 
¢ Determine a simple expression for x(t), assuming that s[”] = 0 for n <0 orn > 4. 


Answer: From (4.3), remembering that E, = 1 and T= 1, 


x(t) = s[O]gix(t) + s[1]gtx(t — 1) + s[2]9tx(¢ — 2) 
+ s[3]otx(t — 3) + 3[4] gtx (t — 4). (4.4) 
* Plot x(t) for g,,(t) = rect(t) and g,, = sinc(?). 
Answer: The plots are shown in Figure 4.2 along with plots of the pulse shapes. 
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Figure 4.2 (a) A rectangular transmit pulse shape. (b) An encoded sequence of 4-PAM symbols 
according to Example 4.1 witha rectangular pulse shape. (c) A sinc transmit pulse shape. (d) An 
encoded sequence of 4-PAM symbols according to Example 4.1 with a sinc pulse shape. The 
symbols in both cases ride the pulse shape. It is harder to visualize in the case of the sinc function 
because of the ripples of the sinc. In practice, functions like the sinc are preferred because they are 
bandlimited. 


In Example 4.1, we provided a visualization of x(t) for 4-PAM modulation for two different 
choices of pulse-shaping functions. In subsequent sections we discuss issues related to complex 
pulse-amplitude modulation with more care, including the choice of constellation, pulse shape, and 
appropriate receiver design. 


4.2 Symbol Mapping and Constellations 


This section provides some background on common constellations and establishes some procedures 
for working with constellations. In particular, the mean of a constellation is determined and removed 
if present. Then the energy of the zero constellation is computed and the constellation is scaled so it 
has unit energy. Throughout the remainder of this book, unless otherwise specified, the constellations 
have zero mean and unit energy. 


4.2.1 Common Constellations 


In this section we review common constellations used with the complex pulse-amplitude modulation 
framework. A constellation is defined by two quantities, the alphabet or set of symbols C and the bit- 
to-symbol mapping. Often a constellation plot is used to illustrate both the complex values in C and 
the corresponding binary mapping. To help in implementation, it is often convenient to order the 
constellation set according to its binary bit-to-symbol mapping. For example, if M = 4, then the 
symbol for the binary number 01 would be the second entry of C. 


Several different constellations are used in commercial wireless systems and are summarized here. 
The normalization of the constellations and their order are designed for pedagogical purposes and 
may vary in different textbooks or standards. Later in this section, we explain the preferred 
constellation normalization used in this book. 


¢ Binary phase-shift keying (BPSK) has 
C = {+1, -1}. (4.5) 
This is arguably the simplest constellation in use. The bit labeling is typically s[n] = (-1)?!"!; 
thus b[n] = 0 — s[n] = 1 and b[n] = 1 — s[n] =-1. 


¢ M-pulse-amplitude modulation (M-PAM) is a generalization of BPSK to any M that is a power 
of 2. The M-PAM constellation is typically written as 


‘A &, 
e={ = Dom fe ree = 2h. (4.6) 


2 


Like BPSK, M-PAM is a real constellation. M/-PAM is not common in wireless systems since 
for M > 2 the most energy-efficient constellations have complex values. 


* Quadrature phase-shift keying (QPSK) is a complex generalization of BPSK with 
C={i+}—-1+j,-1-j,1—j} (4.7) 


Essentially QPSK uses BPSK for the real and BPSK for the imaginary component. QPSK is 
also known as 4-QAM and is used in several commercial wireless systems like IEEE 802.11¢ 
and IEEE 802.16. 


¢ M-phase-shift keying (PSK) is a constellation constructed by taking equally spaced points on 
the complex unit circle 


joxk ) M-1 
é= jer } | (4.8) 


While not strictly required, in practice / is chosen to be a power of 2. The constellation 


symbols are the M" roots of unity. 4-PSK is a different and alternative generalization of BPSK 
with 


i |e (4.9) 


The most common PSK constellation is 8-PSK, which fills the gap between 4-QAM and 16- 
QAM and provides a constellation with 3 bits per symbol. 8-PSK is used in the EDGE 
standard, an extension of the GSM mobile cellular standard [120]. 


¢ M-QAM 1s a generalization of QPSK to arbitrary M, which is a power of 4. M-QAM is formed 
as a Cartesian product of two M//2-PAM constellations, with symbols of the form 


_ kof as 
CE fane goles (4.10) 


where k, £ © {-(M/2 — 1), -—(M/2 —-2),...,—-1,1,...,(M/2 —1)}. Incommercial wireless 
systems, 4-QAM, 16-QAM, and 64-QAM are common, for example, IEEE 802.11a/g/n, IEEE 
802.16, and 3GPP LTE. IEEE 802.1lac adds support for 256-QAM. Microwave links used for 
backhaul can support much higher values of M [106]. 


In a wireless system, the performance in terms of coded bit error rate 1s determined in part by the 
exact mapping of bits to symbols. There are no unique bit-to-symbol mappings in practice, though the 
bit labelings in Figure 4.3 are typical. The specific labeling in this case is known as Gray coding. 
Essentially the labeling is done such that the closest symbols differ by only a single bit. It turns out 
that the closest symbols are the most likely errors; thus this results in a single-bit error for the most 
typical bit errors. Set-partitioning (SP) labeling is another paradigm for constellation labeling used in 
trellis-coded modulation. The constellation is partitioned into mutually exclusive subsets of symbols 
called cosets, and the labels are assigned to maximize the separation between the nearest neighbors in 
the same coset [334, 335]. Other labelings are associated with more advanced types of error control 
codes like bit-interleaved coded modulations. These labelings may be determined through 
optimization of coded bit error rate performance [60]. 


BPSK 


Im 
(0000) (0100) (1100) (1000) (011) 
° e+e m4 (010) (001) 
V10 
(0001) (0101) (1101) (1001) 
_ e e ,2@ — (110) (000) 
10 V10 


1 Re 
& & @ fe 
(0%!) (0111) | (4111) (1011) Ps 
~3j 
=i (111) 
e & 3 s 


(0010) (0110) (1110) (1010) 


16-QAM 8-PSK 


(000000) (001000) (011000) (010000) 
e a. 


(110000) (111000) (101000) (100000) 
& e 2 ® 


(110001) (111001) (101001) (100001) 
& € & & 


(110011) (111011) (101011) (100011) 
ob @ a ® 

(110010) (111010) (101010) (100010) 
@ e @ ® 


(000001) (001001) (011001) (010001) 
& = S ® 


(000011) (001011) (011011) (010011) 
@ B & @ 

(000010) (001010) (011010) (010010) 
& & e ® 


& S @ \vc@ Re 
(110110) (111110) (101110)(100110) 
® ® & @ 

(110111) (1117111) (101111)(100111) 


@ é e 
(110101) (111101) (101101)(100101) 


'7 

v6 @ & 

(000110) (001110) (011110) (010110) 
& B & » 

(000111) (001111) (011111) (010111) 
® & & & 

(000101) (001101) (011101) (010101) 


® & & 
(110100) (111100) (101100)(100100) 


aa a fe 
e e© ee e: 
(000100)(001100) (011100) (010100) 


64-QAM 
Figure 4.3 Several different constellations with generally accepted bit-to-symbol mappings based 
on Gray labeling to minimize the number of bit differences in adjacent symbols 


4.2.2 Symbol Mean 
In this section, we compute the mean of a constellation and show how to transform it so that it has 
zero mean. 


Any given finite set of complex numbers C can be used as a constellation. For analysis and 
implementation in wireless systems, it is desirable for the symbol mean 


Ls = E, [s[n]| (4.11) 


to be zero, for example, ., = 0. To compute the mean (and the constellation energy), it is usually 
assumed that the symbols in the constellation are equally likely; thus P,[s[n] = c] = 1/M for any c © 
C, and further that s[7] 1s an IID random process. Equally likely symbols are reasonable if there is 
any form of encryption or scrambling prior to transmission, as is usually the case. Under this 
assumption, the expectation in (4.11) may be computed as 

M-—1 


1 
hs = M » Cm: (4.12) 


4 
m0 


Therefore, if the mean of the constellation is defined as the right-hand side of (4.12), then the symbols 
will also have zero mean. Having zero-mean constellations also ensures that the complex baseband 
signal x(t) is also zero mean. A nonzero mean corresponds to a DC component, which makes analog 
baseband implementation more challenging. 


Given an arbitrary C, it is desirable to compute an equivalent constellation that has zero mean. It is 
possible to construct a zero-mean constellation Cp from C by simply removing the mean from all the 


constellation points. To do this, first compute the mean of the constellation: 


, “= 
—— oles 4.13 
* M X ? 

Then construct a new zero-mean constellation by subtracting the mean: 
Co = {Gg play Ca Bigg wews CM-—1 — bs}. (4.14) 


For the remainder of this book (except in the problems or where otherwise stated) we assume that the 
constellation has zero mean. 


4.2.3 Symbol Energy 


Now we define the symbol energy and show how to normalize the constellation so that the symbol 
energy is unity. 
The symbol energy is defined for zero-mean constellations as 


E, =E, ols , (4.15) 


To calculate E, we also use the equally likely and IID assumption on s[7]; thus 


E, =E, olM (4.16) 
M-1 


l , _— 
=e > Gals (4.17) 


m=0 


In this book, we adjust the constellation so that E, = 1. The reason is that the symbol energy £, is 


related to the amount of transmit power contained in the signal x(t), as is explained in Section 4.3. If 
E, is large, then the transmitted signal will have a large amplitude (equivalently the transmit power 


will also be large). In our mathematical framework, we reserve the term £, to control the gain in the 
transmitted signal; thus we adjust the constellation so that £, = 1. Different textbooks make different 


assumptions about the normalization of the constellations. Most practical implementations use 
constellations that are normalized, as proposed in this book, because gain in the transmitted signal is 
applied in the analog front end of the radio and not by the baseband signal procesing. 


To find an equivalent constellation C with unit energy given an arbitrary zero-mean constellation, it 
suffices to scale all the constellation points by some o such that the resulting constellation has £, = 1. 


All the constellation points are scaled by the same value to preserve the distance properties of the 
constellation. 


Let the scaling factor a be selected to force the new constellation to have £, = 1. Given an arbitrary 
zero-mean constellation C= {co, cj, ..., Cjy_;}, the scaled constellation is 


Cg = Og hE yy ong HE Qo fs (4.18) 


Now substituting (4.18) into (4.17) and setting F, = 1: 


ee nr (4.19) 


The scaled constellation C,, with the optimum value of a in (4.19) is the normalized constellation. We 
conclude this section with some example calculations of normalized constellations in Example 4.2 
and Example 4.3. 


Example 4.2 


Figure 4.4 contains two different 8-point signal constellations. The minimum distance between 
adjacent points is B. Assuming that the signal points are equally likely in each constellation, find £, 


for each constellation and suggest a suitable choice of a to make the constellation have unit energy. 
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Figure 4.4 Two signal constellations 

Answer: For the constellation on the left, the average transmitted power is 
i eee 1 , 

E; = = a + B? + B* + B*) + 4 (2B* + 2B? 
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From (4.19), a suitable normalization is 


For the constellation on the right, the average transmitted power is 


oo: Bt, B ial 3B* , 3B" wll 7BY 7B, TB’ TB 
P 8\ 4 4 8\ 4 1 8\ 4 | | { 


1 /{ B? 1 { 3B? 1 - 
ee alee i. = 708" 
(5) +a(2) 52 


B? 3B? 7B? 
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From (4.19), a suitable normalization is 


(4.24) 


_ ' 4,29 
a 9B ( ) 
Example 4.3 
Consider a 4-point constellation® = {j.(1 —j)/ V2, -j, (lis v2} 
* Compute the mean of the constellation. 
Answer: The mean of the constellation is 
alge fi ge Se Bly _g (4.30) 
ls =—{j- —(-1- —(1- 4.30 
I 1 Po] 2 J /2 J 
1 
— —j——. 4.31 
5 (4.31) 


* Compute the zero-mean constellation. 


Answer: The zero-mean constellation is 


=} (14 =m) (1 — j)/V2, ( L+ = alis(-I iyiv3} (4.32) 


* Compute the normalized zero-mean constellation. 


Answer: Computing (4.19), 


a= |_———_,—_—________,—_- (4.33) 


= (4.34) 


then the normalized zero-mean constellation is 


dt 1 t 1 1 1 
a =4 3) (Ed tL fee eel Se Fo 
ak Oc ae ed a 


For the rest of this book, unless otherwise specified, we assume that the constellation has been 
normalized so that the symbol energy is 1. 


4.3 Computing the Bandwidth and Power of x(4) 


The signal x(t) 1s the complex baseband signal that is passed to the analog front end for upconversion 
and transmission on the wireless channel. As such it is useful to define several quantities of interest 
related to x(t). Along the way, we also establish our main normalization assumption about g;,(7). 


To compute the bandwidth of a signal x(t), the conventional approach would be to compute the 
Fourier spectrum and make a bandwidth determination from that spectrum. As s[n] is usually modeled 
as an IID random process, this direction calculation is not meaningful. As discussed in Chapter 3, the 
bandwidth of a random process is usually defined based on its power spectral density, for WSS 
random processes. While x(f) is in fact a random process, it is actually not WSS. In fact it is what is 
known as a cyclostationary wide-sense stationary random process [121]. The subtleties of this 
process are beyond the scope of this book; for our purposes it suffices that we can define an 
operational notion of power spectral density for such processes as 

Ey 9 


Pra(f) = Gr Es |Gtx(F)I". (4.36) 


Under the unit symbol energy assumption, F, = | thus 


o ce, weg stat as 
Pra (f) = P Gixl(f)| (4.37) 
and it is apparent that |G,,(/)|? determines the bandwidth of x(t). 
To calculate the power associated with x(f), it is useful to make some additional assumptions about 
£,,(t). In this book, unless otherwise stated, we always assume that the pulse shape is normalized to 
have unit energy. This means that 


POO ‘5 
| Galtier =1 (4.38) 
or equivalently by Parseval’s theorem 
| gtx (t)|"dt = 1. (4.39) 


We assume this for the same reason that we assume the constellation has unit energy. Specifically, in 
our mathematical formulation we want to use E, exclusively to control the transmit power. Under this 


assumption the transmit power is 


[ Pelfdg ==. (4.40) 


where £, plays the role of the signal energy and energy per second is the appropriate unit of power. 


Example 4.4 
Determine the normalized version of the pulse-shaping filter 


Stx,non(t) = sinc?(t/T), (4.41) 


the power spectral density of the corresponding x(t), and the absolute bandwidth. 
Answer: Compute the normalized pulse shape by finding its energy: 


/ |9tx,non(t)|*dt = / eiccronl st) | aie (4.42) 
= ma : (1=|firydf (4.43) 

‘T . 4.44 

=a (4.44) 


Therefore, the normalized pulse shape is 


¢ x — = 5 ; = . te 
Gtx (©, \ 5 sinc T (4 45 


The power spectral density of x(t) with the normalized pulse shape comes from (4.37). Assuming 
the constellation has been properly normalized, 


= Ey (3, — ~ 
Pro(f) = a \ ap lL AVI) (4.46) 
= = B,A*(fT). (4.47) 


The absolute bandwidth of P,,(/) is just the absolute bandwidth of |G,,(f)|?. Since 
Gx(f) = TACT) 
absolute bandwidth is 1/T. 


, from Table 3.2 and using the time-scaling theorem from Table 3.1, the 


4.4 Communication in the AWGN Channel 


In this section, we explore communication in the presence of the most basic impairment: AWGN. We 
start with an introduction to AWGN. Then we propose a pulse shape design that achieves good 
performance in AWGN channels. Important concepts like Nyquist pulse shapes and the matched filter 
are a by-product of the discussion. Then, assuming an optimum pulse shape, we derive the maximum 
likelihood symbol detector for AWGN. We conclude the section with an analysis of a bound on the 
probability of symbol error, which reveals the importance of the SNR. 


4.4.1 Introduction to the AWGN Channel 

Beyond bandwidth and power, other characteristics of the pulse-shaping filter g,,(t) play a role in 
determining how well the receiver can detect the transmitted symbols. The design of the pulse- 
shaping filter depends on the impairments that are created by the propagation channel and also the 
electronics at the receiver. The most basic impairment—which 1s a steppingstone for dealing with 
more complicated impairments—is AWGN. The transmit pulse-shaping filter 1s commonly chosen 
assuming AWGN is the main impairment, even in more complicated systems. In this section, we 
introduce the AWGN channel model and the associated transmitter and receiver. 

Thermal noise is present in every communication receiver. It is created from the variation of 
electrons due to temperature and occurs in the different analog receiver components. Because the 
received voltages can be very small in a wireless system (because of distance-dependent path loss as 
discussed in Section 2.3.3), thermal noise has a measurable impact on the received signal. There are 
other kinds of noise in wireless systems, but thermal noise is the most common impairment. The 
design of the RF front end has a great impact on the severity of thermal noise through the effective 
noise temperature influenced by the choice of components [271]. Further discussion, though, 1s 
beyond the scope of this book. 

The AWGN communication channel is a mathematical model for the impairment due to thermal 
noise. Because we deal with the complex baseband equivalent system, we consider AWGN as 
applied at baseband. A simple model is illustrated in Figure 4.5. Mathematically 


y(t) = x(t) + v(t), (4.48) 


v(t) 
(AWGN) 
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Figure 4.5 The additive white Gaussian noise communication channel 


where x(f) is the transmitted complex baseband signal, v(t) is the AWGN, and y(f) is the observed 
complex baseband signal. 
The assumption that v(t) is AWGN has the following implications: 


¢ The noise is additive. Other types of noise, like multiplicative noise and phase noise, are also 
possible but lead to other types of additive impairments. 


¢ The noise is IID (this is where the term white comes from, since the power spectral density of 
an IID signal 1s flat in the frequency domain) and is therefore also a WSS random process. The 
autocorrelation function is R,,(t) = o7 8(t), and the power spectral density is Pio f) = Oo”. 


¢ The first-order distribution of v(t) is N.(0, 67). 


¢ The total variance is 6” = N,. The noise spectral density is N, = kT., where k is Boltzmann’s 
constant k = 1.38 x 10-73 J/K and the effective noise temperature of the device is 7, in kelvins. 


Assume T, = 290K in the absence of other information. The effective noise temperature is a 


function of the ambient temperature, the type of antennas, as well as the material properties of 
the analog front end. Sometimes N, is expressed in decibels. For example, if 7, = 290K, then 


N, = —2284B/Hz. 


The signal-to-noise ratio (SNR) is the critical measure of performance for the AWGN 
communication channel. We define it as 


ce 
SNR =—, (4.49) 


200 


Many different performance measures in AWGN channels are a function of the SNR, including the 
probability of symbol error and the channel capacity [28]. In other types of noise or receiver 
impairments, different measures of performance may be more suitable. The SNR, though, is a widely 
used parameter in wireless systems. 


4.4.2 Receiver for Complex Pulse-Amplitude Modulation in AWGN 


The receiver in a digital communication system is tasked with taking an observation of y(¢) and 
making its best guess about the corresponding {s[n]} that were transmitted. The theory behind this 
“best guess” comes from detection theory [340, 339, 341] and depends on the assumed channel model 
that describes the transformation of x(t) to y(t). Detection theory 1s related to hypothesis testing in 
statistics [176]; in this case the receiver tests different hypotheses about the possible values of the 
symbols that were sent. 


In this book, we focus on the receiver structure for complex pulse-amplitude modulated signals x(7) 
as described in (4.3). The derivation (from first principles) of this optimum receiver structure for the 
AWGN channel for arbitrary modulations 1s beyond the scope of this book. To simplify the 
presentation, yet capture the salient details of the receiver function in an AWGN system, we start with 
the observation that the optimal receiver has the form in Figure 4.6. 
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Figure 4.6 Transmitter and receiver for an AWGN communication channel 


Now we summarize the key receiver blocks in Figure 4.6. The filter g.,(t) is known as the receiver 


pulse-shaping filter. It performs a bandlimiting operation among other functions, as will become 
clearer in Section 4.4.3. The continuous-to-discrete converter samples the received signal at the 
symbol rate of 1/7. The detection block produces a good guess about the s[] that might have been 
transmitted based on the sampled output of the C/D. The inverse symbol mapping determines the bit 


sequence corresponding to the detected symbol sin] output from the detection block. The blocks in the 
receiver in Figure 4.6 work together to reverse the corresponding operations at the transmitter. 


There are three important design decisions in specifying the operation of the receiver: 
1. Determine the best receiver filter g..(¢) and how it depends on the transmitted signal. It turns 
out that a matched filter is optimum, as described further in Section 4.4.3. 


2. Specify what happens in the detection block using mathematical principles. We show in Section 
4.4.4 that the detector should implement a minimum Euclidean distance detector, where the 
(scaled) constellation symbol that is closest to the observed sample is output from the detector. 


3. Determine a more realizable structure for the pulse-shaping and filtering operations in Figure 
4.6. In Section 4.4.5, we use multirate signal processing concepts from Chapter 3 to implement 
both the transmit pulse shaping and the receive filtering in discrete time. 


The first two design decisions are addressed in this section, and the solution to the third is found in 
Section 4.5. 


4.4.3 Pulse Shape Design for the AWGN Channel 


In this section, we devise designs for the transmit pulse shape g;,(t) and the receive filter g,,(¢). The 


derivation is based on the idea of maximizing the signal-to-interference-plus-noise ratio (SINR), 
assuming a memoryless detector. Other types of signal processing, like equalization followed by 
detection, are explored in Chapter 5. 


The derivation starts with the received signal after AWGN: 


nis = V Ex Grx(t) * Otx(t) * _ s|m]6(t — mT’) + grx(t) * v(t). (4.50) 


m 


Let g(t) = g,,(2) * 2,,(¢) denote the combined transmit and receive pulse-shaping filters. After 
sampling at the symbol rate, 


y(n] = VEx > s|m|g((n — m)T) + o[n) (4.51) 


ee 


where v[n| = J v(7)grx(nT — 7)dr 


we rewrite (4.51) as 


y(n] = /Eys[n] + W/E be s|{m\g((n — m)T) + v[n]. (4.52) 


m+=n 


is the filtered noise. The detector extracts s[n] from y[7]; thus 


The first term is the desired signal, the second term is intersymbol interference, and the third term is 
sampled noise. As such, it makes sense to select g,,(¢) and g,,(¢) to permit the best detection 


performance. 


We propose to design the transmit pulse shape and received filter to maximize the SINR. The signal 
energy 1s 


E | VExsinlg(0), | = E,|9(0)|?. (4.53) 


The only way to increase the signal energy is by increasing F, (the transmit power) or increasing 
g(0). It will be clear shortly why increasing g(0) is not an option. 
The ISI energy is 


E VJ Ex > s|mlg((n — m)T) =e >. Ig(mT)|?. (4.54) 


m,men m0 


The ISI can be decreased by reducing F,, but this also reduces the signal power. The only other way 


to reduce ISI is to make |g(mT)| as small as possible for m # 0. There are special choices of g(f) that 
can completely eliminate ISI. 


The noise energy is 
E [lgrx(t) * oar P] = No f IGrx( A) Pag. (4.55) 


The larger the bandwidth of g.,(2), the higher the noise density. Reducing the bandwidth also reduces 


noise, as long as the bandwidth is at least as large as the signal bandwidth. Otherwise the transmitted 
signal may be filtered out. 


Treating the ISI as an additional source of Gaussian noise, the SINR becomes a relevant 
performance metric for AWGN channels. Computing the ratio of (4.53) with the sum of (4.54) and 
(4.55), 


E,.\9(0)|? 
No J \Grx( f) Pdf + Ex Lmgn |g(mT) 


SINR = 


P 


Now we proceed to find a good choice of g,,(¢) and g,,(t) to maximize the SINR. 


Our approach to maximizing the SINR is to find an upper bound that can be achieved with equality 
for the right pulse shape design. First, examine the average received signal power term E,|2(0)|7. 
From the definition of g(t), notice that 


g(0) = / Dex (—t) Gtx (t)dt. (4.57) 


To upper bound g(t), we apply the Cauchy-Schwarz inequality. For two complex integrable functions 
a(t) and b(t) with finite energy, the Cauchy-Schwarz inequality says that 


(/ a" (u(t), < [- la(t)|?dt , |b(t) |? dt (4.58) 


with equality if and only if b(t) = aa(t) where o is a (possibly complex) constant. Applying Cauchy- 
Schwarz to (4.57), 
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\9(0)|" = | 9e—Baa(t)at (4.59) 
< J lgcx(t)P at f get)? (4.60) 
= J lgex(t)P at (4.61) 


where the last step follows because we have assumed that g,,(t) has unit energy. Since g.,(t) shows 
up in every term of the numerator and denominator of the SINR, we can without loss of generality 
also assume that g,,(f) has unit energy. Consequently, take 9rx (t) = gtx(-t). sucha choice of 2yx(0) 


is known as a matched filter and leads to g(0) = 1. Effectively, the matched filter correlates the 
received signal with the transmit pulse shape to achieve the largest signal energy. 


Now we further upper bound the SINR by minimizing the denominator of (4.56). Since g.,(f) is 
determined from g;,(t), the noise power at this point is constant and cannot be further minimized. 
Instead we focus on the ISI term. Because the terms in the summation are nonnegative, the summation 
is minimized if all the terms are zero, that is, if Ey > no | g(mT)|* = 0. This is possible only if g(m7) = 


0 for m # 0. Combined with the fact that g(0) = 1 from our assumed normalization of the receive filter 
Z,,(t), the [SI is zero if it 1s possible to find g,,() such that 


g(nT) = 6[n]. (4.62) 
Notice that (4.62) places requirements on the samples of g(t) but does not otherwise constrain the 
waveform. Assuming such pulse shapes are realized, optimum SINR becomes simply SNR = E,/N,. 
To discover the implications of (4.62), let us go into the frequency domain. Treating g,[”] = g(nT) 


as a discrete-time sequence and taking the Fourier transform of both sides gives 


But recall from (3.83) that the CTFT of a continuous-time signal G(f) and the DTFT of the sampled 
signal Ger ) are related through 


- 
Ga(e?"F) = S° G(fT +h). (4.64) 
k=—co 
Combining (4.63) and (4.64) gives 
oc 
S> G(ff +k) =1. (4.65) 
k=—o0 
Effectively, the aliased sampled pulse shape should be a constant. 


Functions that satisfy (4.62) or (4.65) are known as Nyquist pulse shapes. They are special. 
Sampling at symbol rate 7, the sampled function g(n7) is zero for n # 0. Note that this is true only if 


the function is sampled at the exact correct place and does not imply that the function is otherwise 
zero. 


Example 4.5 
Consider a rectangular pulse shape 


—— 


Jtx(t) = \/ zrect(t/ T — 1/2). (4.66) 
* Find the matched filter g.,(7). 
Answer: 
Grx(t) = 9tx(—t) (4.67) 
12 , | 
=1/ arect(—t/T — 1/2). (4.68) 


¢ Find the combined filter g(t) = J 2x( S(t — T)dt. 


Answer: 


g(t) = / 9tx(T) Grx(t — T)dr (4.69) 


— ie (=) (4.70) 


where A is the triangle pulse in Table 3.2. 

° Is g(t) a Nyquist pulse shape? 
Answer: Yes, since g(n7) = A(n) = 6[n]. It is not a particularly good pulse shape choice, 
though, because g,,(t) is not bandlimited. 


Perhaps the best-known example of a Nyquist pulse shape is the sinc: 
Jsinc(t) = sinc(t/T). (4.71) 


With the sinc function, the baseband bandwidth of the pulse shape is 1/27, and there is no overlap in 


‘(fk 
the terms in © (f + T) in (4.65). Other choices of g(t) have larger bandwidth and the aliases add up 
so that equality is maintained in (4.65). 


The sinc pulse-shaping filter has a number of implementation challenges. Ideal implementations of 
g(t) do not exist in analog in practice. Digital implementations require truncating the pulse shape, 
which is a problem since it decays in time with 1/t, requiring a lot of memory. Furthermore, the sinc 
function is sensitive to sampling errors (not sampling at exactly the right point). For these reasons it 1s 
of interest to consider pulse shapes that have excess bandwidth, which means the baseband 
bandwidth of g(t) is greater than 1/27, or equivalently the passband bandwidth is greater than 1/7. 


The most common Nyquist pulse, besides the sinc pulse, is the raised cosine. The raised cosine 


pulse shape has Fourier spectrum 


zi 0<|f|<(Q-a)/2T 
Me (f)= 4 F [1+cos (= (f|-33#))], tae <Ifl < (4.72) 
0, fl> 
and transform 
cos(7at) 


dee(t) = sine(t/T) Foe pa (4.73) 


The parameter a is the rolloff factor, 0 < a < 1. Sometimes f is used instead of a. Often the rolloffis 
expressed as a percentage of excess bandwidth. In other words, 50% excess bandwidth would 
correspond to a = 0.5. 


Example 4.6 
Show that for any value of a, the raised cosine spectrum G,,(f) satisfies 


[- GA Hafea: (4.74) 


Answer: Let g,,(¢) be the inverse Fourier transform of G,,(f). At the sampling instants t = n7, we 
have 


"OO 
Ire(nT) = | Gulfjei™ af. (4.75) 
—oo 
We divide the integral in (4.75) into several integrals covering length 1/T intervals, which yields 
oo »(2m+1)/(2T) : 
Gre(nT) = | Geel fiend f (4.76) 
m=—co ¥ (2m—1)/(2T) 
9° 1/(2T) . 
= i Gre( f + m/T)e2*4"" df (4.77) 
m=—oo” — 1/(2T) 
1/(2T) 0° . 
= | >. Gre(f +m/T) pel?*F"T df (4.78) 
J—1/(2T) m=— oo 
f(T) 
= / Tel? inl af (4.79) 
=W(27) 


where (4.79) follows from the fact that the raised cosine function satisfies the Nyquist criterion. Thus, 
by setting n = 0, we have that 


9re(0) = | Orel fd f (4.80) 
1/(2T) 

. | Tdf (4.81) 
J—1/(2T) 


=1. (4.82) 


Example 4.7 


Using the results from Example 4.6, consider now the behavior of the raised cosine pulse shape when 
either oversampled or undersampled. 


* Is it still a Nyquist pulse shape if sampled at 1/(27)? 
Answer: For all fnote that 


Ge (F+ 57) = Ge (t+ 7) + mals a) (4.83) 
k ~ h 
=" Gre (s i. 7) + ¥ Gre (1 ft “ A. =) (4.84) 
h h m= 


A B 
=T+T (4.85) 


where the first term is T from Example 4.6 and the second term 1s 7 since shifting a flat 
spectrum maintains a flat spectrum. 


¢ What about sampling at 2/7? 
L+c 
Answer: The bandwidth of G,,(/) is OT which is at ea 1/T. Sampling with 2/7 means that 


there is no overlap in the functions in dik Gre (Ff + F T). Because there is no overlap, the 
spectrum cannot be flat since the spectrum of G,,(f) is not flat. 


The absolute bandwidth of the raised cosine pulse shape at baseband is (1 + a)/2T and at passband 
is (1 + a)/T. The Nyquist rate of the baseband digital communication signal x(t) with a raised cosine 
pulse shape is (1 + a)/T. Note that 1/7 1s the rate at which x(t) is sampled. Thus there is aliasing in 
x(t) since Nyquist is not satisfied. The zero-ISI condition, though, ensures that aliasing does not lead 
to intersymbol interference in the received signal. 

The raised cosine is plotted in Figure 4.7. For larger values of a the spectrum rolloffis smoother 


and the time-domain decay of the pulse is faster. This makes implementation easier—at the expense of 
excess bandwidth. 
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Figure 4.7 (a) The raised cosine pulse shape for various choices of o in the time domain. (b) The 
raised cosine pulse shape for various choices of o in the frequency domain. A value of T= 1 is 

assumed. 


In practice we do not usually use the raised cosine pulse directly. The reason 1s that the raised 
cosine pulse shape is g(t) whereas we use g,,(¢) at the transmitter and g,,(¢) at the receiver. Recall 


that g(t) = } 2 x(T) Z(t — T)dt and that g..() = g,,(—t). In the frequency domain this means that 
G(f) = Gex( GET), Consequently, we choose g,,(¢) to be a “square root” of the raised cosine. 
Such a pulse shape is known as a square root raised cosine or a root raised cosine and is written 


sin(({1—a)rt/T) 


dq cos((1+a)at/T) + 4at/T 


sc ro(E) = ; 9 
SeareXt) r/T 1 — (4at/T)° 


Note that this pulse shape is normalized. Note also that g,,..(¢) 1s even; thus if g,.(t) = Zsqrc(t), then 
2x(0) = Z(t) = 2,,(0) and the transmit pulse shape and the receive pulse shape are identical. The 


(4.87) 


Square root raised cosine is its own matched filter! 

Root raised cosines are a common transmit pulse for complex pulse-amplitude modulation. We 
now summarize key relationships between the bandwidth, the symbol rate, and the excess bandwidth 
with a square root raised transmit pulse shape. Equivalently when g(t) is a raised cosine: 

¢ The symbol rate is R = 1/T where T is the symbol period. 

¢ The absolute bandwidth of the complex pulse-amplitude modulated signal with root raised 
cosine pulse shaping at baseband is (1 + a)/27. 

¢ The absolute bandwidth of a complex pulse-amplitude modulated signal with root raised cosine 
pulse shaping modulated at some carrier frequency is (1 + a)/T. This is twice the bandwidth at 
baseband per the difference in bandwidth definitions between baseband and passband signals. 

¢ The Nyquist frequency for a root raised cosine pulse-shaped complex pulse-amplitude 
modulated signal at baseband is (1 + @)/2T. 

¢ The Nyquist rate for a root raised cosine pulse-shaped complex pulse-amplitude modulated 
signal at baseband is (1 + a)/T. 

A side benefit of the square root raised cosine filter is that the sampled noise becomes 


uncorrelated. Specifically, consider the discrete-time sampled noise in (4.51). The noise 1s still zero 
mean since Ev|0 [7] = 9. The autocovariance of the noise is 


Cov[k] = Ey [v[n]o*[n + k]] (4.88) 
=E, | [ [ (r1)0" (T2)deqre(NT — Tr) Gaqre((n—k)T —72)dryd72| (4.89) 
=o | Paare( M1 5 — T)Qsare( (se — &)T — 7) dr (4.90) 


= 3 | Gearc(T )Gsarc( —kT - T)dr (4.91) 


where we have used the IID property Ey [v(71)v(t2)] = 4(72 — 71) to simplify (4.89) to (4.90). 
Now recognizing that 2.qrc(¢) = sqre(—t), 1t follows that 


Cue lke? / Yuare(T)Jeare(KT + 7)dr-. (4.92) 
Notice, though, from the construction of the square root filter that 
Coie / Gearc(t — T)Gaare(T aT (4.93) 
= | Goqre(t + T)Gsqre(T dt (4.94) 
because of even symmetry. Since the raised cosine is a Nyquist pulse shape, it follows that 
Cov[k] = 05 9rc(KT) (4.95) 
= 02 6(k] (4.96) 


where 7 = No for complex AWGN. Therefore, the square root raised cosine preserves the IID 
in of the noise (the whiteness). This property is true for any appropriately chosen real square 
root of a Nyquist pulse. Detection with noise that is not white, which is correlated, is more 
complicated and 1s not discussed in this book. In practice, correlated noise is dealt with at the 
receiver by using a whitening filter. 


4.4.4 Symbol Detection in the AWGN Channel 


In this section, we address the problem of inferring the transmitted symbols given noisy observations 
at the receiver. This process is known as symbol detection. 

With a suitable combination of Nyquist pulse shaping and matched filtering (using the square root 
raised cosine, for example), the received signal after sampling can be written as 


= VEx ¥— s|mg((n — m)T) + grx(t) * v(t)| nr (4.97) 


mr 
= /E,s| s[n] + v[n] (4.98) 


where v[7] is IID complex Gaussian noise with NV.(0, N,). 
The detection problem is to answer the following question: 
Based on y[n], what is my best guess as to the value of s[]? 


This is an example of symbol detection. Other kinds of detection, such as bit detection or sequence 
detection, are used when detection is combined with joint forward error correction decoding. 


A symbol detector 1s an algorithm that given the observation y[”] produces the best s[n} EC 
according to some criterion. In this book, we consider the maximum likelihood (ML) detector 


s|n] = arg max x fy\s( (y[n||s[n] = s) (4.99) 


where f,,/,(-) 1s the conditional PDF of y[7] given s[n] and is known as the likelihood function. 
To implement ML detection we need to find an algorithm for solving the equation in (4.99). To 
make implementation easier, it often pays to simplify the formula. For the AWGN channel, “ 


conditional distribution of y[”] given s[n] = s 1s Gaussian with mean V Ex8 and variance 7 
Therefore, 


fyjs(yln||s{n] = 8) = fo(yln] — VExs) (4.100) 
= 5 € aT) . (4.101) 
107; 


The ML detector solves the optimization 


arg max f,,).(y[n]|s[n] = s) = arg min —;e o% (4.102) 
sEC * 


sEC 107, 


Note that the objective is to find the symbol s © C that minimizes the conditional likelihood, not the 
value of the conditional likelihood at the minimum value. This allows us to search for the minimizer 
and not the minimum and in turn simplify the optimization even further. We can neglect the scaling 
factor since it does not change minimizer s but only the value of the minimum: 


; yim} — v ‘Ex s|? ; 
arg max f,,)5(y|[n]|s[n] = s) = arg mine oF ; (4.103) 
sce" sEC | 


The In(-) function is a monotonically increasing function. Therefore, the minimizer of In(f(x)) is the 
same as the minimizer of f(x). Consequently, 


arg max In (f,).(y(n]|s[n] = s)) = arg max —|y[n] — V E,,s|?. (4.104) 
sec es sEC 


Instead of minimizing the negative of a function we maximize the function: 


arg max fyjs(y[n]|s[n] = s) = arg min |y[n] — V Exs|°. (4.105) 
a iad sEC 


This gives the desired result for ML detection in an AWGN channel. 


The final expression in (4.105) provides a simple form for the optimal ML detector: given an 
observation y[n], determine the transmitted symbol s © C, scaled by £,, that is closest to y[] in 


terms of the squared error or Euclidean distance. An example calculation of (4.105) is illustrated in 
Example 4.8. 


Example 4.8 


Consider a digital communication system that employs BPSK. Suppose that VEx = 2 Symbol 
sequences s[0] = 1 and s[1] =—1 are transmitted through an AWGN channel and the received signals 
are r[0] =-0.3 + 0.1) and 7[1] =—0.1 — 0.4). Assuming ML detection, what will be the detected 


sequence 8|N) at the receiver? 
Answer: The detection problem involves solving (4.105). For this purpose, we make the following 
calculations to find $1} first by hypothesizing that s[0] =—1, 


Ir(o) — 2|7 = | —0.34+0.1j — 2|? (4.106) 
= 4.42, (4.107) 


and then by hypothesizing that s[0] = 1: 


\r[0] + 2|? = | —0.34 0.1j + 2? (4.108) 

=i. es (4.109) 

Since 3.62 < 4.42, it follows that 8|0] = —lis the most likely symbol. Repeating the calculations to 
find §|1] 

\r[1] — 2/? =| —0.1 — 0.4j — 2/? (4.110) 

= 4.57, (4.111) 


and then by hypothesizing that s[1] = 1: 


lr(1] + 2|? = | —0.1 — 0.4j + 2? (4.112) 
= 3.77. (4.113) 
Since 3.77 < 4.57, it follows that s|1] = —lis the most likely symbol. During the detection process, 


the detector makes an error in detecting s[0] since s|0] £ s|0] 


In general, an ML detector for a linear modulation scheme like the one we have considered solves 
a detection problem by computing the squared distance between the scaled constellation point and the 
observation y[n]. For some constellations, the decision can be further simplified by exploiting 
symmetry and structure in the constellation, as shown in Example 4.9. 


Example 4.9 


Show how to simplify the detector for BPSK by exploiting the fact that for s © C, s is real and |s|* = 
i. 


Answer: Starting with the argument of the minimization in (4.105) and expanding: 


arg min ly[n] — //Exs|? = arg min ly[n]|* + |Bs|* — 2Re Ly" In] VExs| (4.114) 
= arg min 1 |y[n]]? + |B,|? — 2Re ly" [n]\/ Es 3 (4.115) 
= arg max Re [y*[n|s! (4.116) 
= arg may sRe [y*[n]] . (4.117) 


The first step follows by expanding the square. The second step follows from |s| = 1. The third step 
neglects constants that do not affect the argument minimization and removes the negative sine. The 
fourth step recognizes that s is real and can be pulled out of the real operation. The final detector 
simply computes Re[y[m]] and —Re[y[]] and chooses the largest value. Similar simplifications are 
possible for M-QAM, M-PAM, and M-PSK signals. 


A concept that helps both in the analysis of the detector and in simplifying the detection process is 
the Voronoi region. Consider symbol s, © C. The set of all possible observed y that will be detected 


as Sy is known as the Voronoi region or Voronoi cell for s. Mathematically it is written as 


 — i: ly- V/ Ey.8¢ ° < y- \/ Ey Sk af 


se,s,p EC, kF eS : (4.118) 


The union of all such cells is the entire complex space. 


Example 4.10 
Compute and plot the Voronoi regions for 4-QAM. 
Answer: We compute the Voronoi region by determining the points where 


2 2 
ly — VExsk| — |y— VExse| > 0. Expanding the squares along the lines of (4.115), exploiting 
the fact that |s,|? = |s/|* = 1, and canceling terms: 


ly — V Ex 8k ; — ly — V Ex s¢ ; = 2Re Ly" VExs¢| — 2Re ly" VExsk | : (4.119) 


The factor of 2V £x can be canceled to leave 
Ve, = {y : Re [y* se] > Re [y*sx]}. (4.120) 


The regions can be calculated by recognizing that all the symbols have the form(+=1+)j) v2 for a 


normalized QPSK constellation. The V2 can also be canceled and the values of £1 and +) used to 
further simplify Re [y*s/] = Re[y]Re[s] + Im[y]Im[s]. Consider the point (1 + j)/ V2. For it to be 
larger than (1 — j)/V2, [l= j)/V2 and (—1 + j)/V2 it must be true that 


Re[y] + Im[y] > Re[y] — Im[y] (4.121) 
Re[y] + Im{y] > —Re[y] — Im|[y! (4.122) 
Rely] + Im[y] > —Re[y] + Iml[y]. (4.123) 
Combining 
Im[y] > 0 (4.124) 
Rely] + Im[y] > 0 (4.125) 
Rely] > 0 (4.126) 


where the second equality is redundant with the other two gives the following simplified Voronoi 


regions for 4-QAM: 


Yo afialitaya { Yy 


Vasj/va = ty: Rely] > 0 and Im[y] > 0} (4.127) 
Va_jyva = ty: Rely] > 0 and Imly] < 0} (4.128) 
: Rely] <0 and Imly] > 0} (4.129) 

Rely] <0 and Imf{y] < 0}. (4.130) 


V(-1-jy/va = TY: 


The Voronoi regions for the 4-QAM constellation are illustrated in Figure 4.8. The four Voronoi 
regions are the four quadrants. Based on this result, a simplified ML detector for 4-QAM can simply 
compute the sign of Re[y[7]] and Im[y[7]], which determines the quadrant and thus the corresponding 


closest symbol. 


Figure 4.8 Voronoi regions for the 4-QAM constellation are the quadrants of the complex plane. 
Any point y[m] that falls ina Voronoi region is mapped to the corresponding symbol that generates 
that region through the ML detection process. 


4.4.5 Probability of Symbol Error Analysis 


The performance of a detector is measured by the probability that it makes an error. In Section 4.4.4, 
we derived the maximum likelihood symbol detector. Consequently, in this section we derive the 
natural measure of performance for this detector: the probability of symbol error, also known as the 
symbol error rate. The field of communication theory is especially concerned with computing the 
probability of error for different detectors. 


The probability of error for a detector depends on the probabilistic assumptions made about the 
channel. Different kinds of noise distributions, for example, lead to different detectors and thus 
different probabilities of symbol error expressions. In this section, we derive the maximum likelihood 


Ex 
detector for the AWGN channel: (2). tt turns out that for the AWGN channel, the probability of 
error is a function only of the SNR, which simplifies as £,/N,; consequently the notation reflects that 


fact. 
The probability of symbol poy is the expected value of the conditional probability of symbol 


EB. 
Pols (Ne) Fels (5 No ar is the probability that the detector makes an error givens, © C 
was transmitted. Assuming the symbols are equally likely, 


M-1 
P. (=) = a Palen (=). (4.131) 


m=U 


error where 


The conditional probability of symbol error is 


FE, Se 
P. —~|=P [Sm is detected incorrectly|s,, was transmitted] (4.132) 
e| Syn N « 
4.%O 
M-1 
—— P|s,, is decoded as se|s,, was transmitted,m £ &|. 4.133 
V1 ‘ 
e=0 
fm 


Computing the probability or error term in (4.133) requires an integration of the conditional 
probability distribution function f,),(x) over the corresponding Voronoi regions. 


The exact calculation of (4.133) is possible for some constellations, though it can be tedious [219]. 
As a consequence, we focus on calculating the union bound, which is a function of the pairwise error 
probability. Let P|Sm — Se] denote the probability that s,,, is decoded as s,, assuming that the 
constellation consists of two symbols {5,,, sp?. This can be written as 


P [sm — se] =P | y — VEx8m ; > ly — VExse : (4.134) 
For complex AWGN, it can be shown that [73] 
P [sm — se] = Q / = [Sm 5 sel" (4.135) 
where 
Q(x) ” ent) 2¢ (4.136) 


=> 27 


is the Gaussian Q-function. While Q(x) is usually tabulated or computed numerically, the Chernoff 
bound is sometimes useful for providing some intuition on the behavior of Q(x): 


Q(x) < 56" ee (4.137) 


Notice that O(x) decreases exponentially as a function of x. This is the reason that the Q-function is 
usually plotted on a log-log scale. 


The pairwise error probability provides an upper bound on the conditional error probability, 


as transmitted,m #1] < Pls, — sg], (4.138) 


P|s is decoded as s 


because the Voronoi regions for a two-symbol constellation are the same size as or larger than the 
Voronoi regions for a larger constellation. Consequently, this is a pessimistic evaluation of the 
probability that s,, is decoded as s,; thus the reason for the upper bound. 


Substituting (4.138) into (4.133), 


M-1 
Fy 
Pele ( : ) < oF P [Sm —> Spl, (4.139) 


and then using (4.131), 


M-1 M-1 E E g 2 
‘x }om — Pf } Av 
Pe GE oT >». », @ an wa ( (4.140) 


m=0 £=0,£4m 


Now define the minimum distance of the constellation as 


d? 


min ~ 


min |$¢ — Sm|?. (4.141) 
seEC, 8m EC .Sm FS8¢ 


The minimum distance is the quantity that characterizes the quality of a constellation and must be 
computed with the normalized constellation. The minimum distance provides a lower bound 


d= se—sul" (4.142) 


min 


for any distinct pair of symbols s, and s,,,. Since the Q-function monotonoically decreases as a 


function of its argument, a lower bound on the argument can be used to derive an upper bound on the 
probability of error: 


E, a2 
7 SS Suu (4.143) 


/\ 


E. M-1 
P, Gas: Sag 2 te 


m=0 


BE, d2, 
Simin) (4.144) 


=(M=1)0)| ig a 


The final expression in (4.144) is the union bound on the probability of symbol error for the 
constellation C. 


Example 4.11 
Compute the union upper bound for /-QAM given that for the normalized constellation 


6 | 
Pe pps, (4.145) 


min M eae 1 
(4.146) 


Answer: Substituting into (4.144): 


: E ) 
PQAM (=F) <(M-1)Q ( 


N,M-1 


Typical values of the probability of symbol error are in the range of 107! to 10~+. To represent the 
values correctly, it is common to plot the probability of symbol error on a log scale. The values of 
SNR that are of interest range from around 1 to 1000. To capture this range, it is common for SNR to 
be calculated and plotted in its decibel form where SNR dB = 10 log;,(SNR). Since double the SNR 
10 log;o(2SNR) ~ 3dB+10 logig(SNR), doubling £, leads to a 3dB increase in the SNR. The 


probability of error for M-QAM is examined in Example 4.12 and plotted in Figure 4.9. 


10° 
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SNR (dB) 
Figure 4.9 Probability of symbol error for /-QAM for different choices of M. Solid lines 
correspond to the exact probability of symbol error from (4.147). Dashed lines correspond to the 
union bound computed using. (4.144) 


Example 4.12 


Plot the union upper bound on the probability of symbol error and compare with the exact solution 
from [73] and given by 


nam ( Es [E, 3 
pOaM | —* | =4(1- =) Q | 4/——— 
; No JM) °\V No M—1 
i \ [E, = 3 . | 
~4{ 1-7) JQ ae 4.147 
( a) aly ew}| ( ’) 


for M=4, M= 16, and M = 64. Plot using SNR in decibels and plot using a semilogy axis. 


Answer: The plots are found in Figure 4.9. The union bound provides a reasonable approximation 
of the probability of symbol error at high SNR but is quite loose at low SNR. The reason is that 
nearest-neighbor error events dominate at high SNR but not at low SNR. The behavior of the exact 
and upper bounds is consistent at high SNR. Comparing the different modulation orders, we see that 

» { Ex 
for low values of “~ \ Ne ) for example, 10~°, there is approximately a 6dB SNR gap between the 
curves. This means that to achieve the same symbol error rate performance, 16-QAM needs four 
times more power since 10 log),9 6 ~ 6dB. For a fixed SNR, the symbol error rate is larger for larger 


values of M. The reason is that the constellation has more points; thus the minimum distance between 
points is smaller. 


The final derivations are a function of SNR = £,/N,, the size of the constellation, and the minimum 


distance of the constellation. For a given constellation, to reduce the probability of error and increase 
system performance, either E, must be increased or N,, must be reduced. The average signal power F, 


can be increased by using more transmit power, though not without bound. The maximum power is 
constrained due to health concerns in most wireless systems. Because of path loss and fading in the 
channel, as discussed in Chapter 5, the signal attenuates with distance. As a result, the SNR measured 
at the receiver becomes the appropriate performance measure, accounting for losses in the channel. 
This can be increased by reducing the distance between the transmitter and the receiver (this will be 
clearer after the discussion of path loss). The effective noise power density is N, = AT, where T, is 
the effective noise temperature of the device. The noise temperature can be reduced to some extent by 
changing the RF design, providing better cooling, or using higher-quality components [271]. 

For a fixed SNR, the constellation can be changed to improve the probability of symbol error. For 
example, 16-QAM has a higher d,,,. than 16-PSK because it achieves a better packing of points. 
Because of the constellation normalization, there are limited gains from going to more elaborate 
constellations than QAM. Alternatively, the number of points in the constellation may be reduced. 
Then the remaining points can be spaced farther apart. This is evident when looking at d,,,,, for V/- 
QAM in (4.145). Unfortunately, reducing the number of points also reduces the number of bits per 
symbol. In systems that support link adaptation, the constellation size is varied adaptively to achieve 
a particular target probability of symbol error, for example, 10-2, as the SNR changes. The 
application usually dictates a target probability of error. This is usually the probability of bit error 


after error control decoding, but it can be translated into an effective probability of bit or symbol 
error before decoding. 


For many systems, especially with more elaborate impairments, it is difficult to calculate the exact 
probability of symbol error. An alternative is to use Monte Carlo simulation. While it seems ad hoc, 
this approach is commonly used to evaluate the true error performance of complex systems. A Monte 
Carlo approach for estimating the probability of symbol error directly from the discrete-time input- 
output relationship is as follows.! First, choose a number of iterations N. Forn=0,1,...,N—-1, 
generate a realization of a symbol s[n] by choosing a symbol uniformly from the constellation C. 
Generate a realization of complex noise v[] from a Gaussian distribution. Most numerical software 
packages have Gaussian random number generators for N (0, 1). You can generate complex noise with 


variance N,, as v[n] = VNo/2(% + jy) where x and y are generated from N(0, 1). Next, form 
y[n] = VExs[n] +v[n] and pass y[n] to an ML detector. If the output 8|n] is different from s[n], 
count it as an error. The probability of symbol error is then estimated as 


1. To simulate more complex receiver impairments, as in Chapter 5, a baseband simulation with the continuous-time waveforms may 
be required. 
~ { hy # errors 
x wT ; f > 
P,{—}|= ——, (4.148) 
No N 


To get a good estimate, a rule of thumb is that NV must be chosen such that at least ten errors are 
observed. It is often useful to compare the final result with the upper bound in (4.144) as a sanity 
check, to make sure that the estimated probability of error is below the theoretical upper bound. 


4.5 Digital Implementation of Pulse Shaping 


In this section, we explain a more practical implementation of transmit pulse shaping and receive 
filtering. The system in Figure 4.6 implements transmit pulse shaping and receive filtering somewhat 
idealistically in analog. This structure does not map to a realizable implementation for several 
reasons. First, there is a mix of analog and digital implementations without an explicit discrete-to- 
continuous conversion at the transmitter. Second, the structure does not allow for the possibility of 
performing transmit and receive pulse shaping entirely in the digital domain. Pulse shaping for digital 
can be an advantage since it is easier to implement than analog pulse shaping and 1s flexible since it 
can be combined with other receiver processing. In this section, we explain how to implement pulse 
shaping at the transmitter and matched filtering at the receiver using multirate signal processing 
concepts from Section 3.4. 


4.5.1 Transmit Pulse Shaping 
In this section we develop an approach for generating x(f) in discrete time, using the fact that when 
the pulse shaping is bandlimited, it follows that x(t) is bandlimted. 

We need to be careful to remember that 1/7 1s the symbol rate and is not necessarily the Nyquist 
rate. For example, for a square root raised cosine pulse shape, the Nyquist rate 1s (1 + a)/7, which is 
greater than 1/7 when there is excess bandwidth, that is, a > 0. For this reason, we need multirate 
signal processing from Section 3.4 to change between the symbol rate and the sample rate. 


We focus on generating the transmit signal prior to scaling by VEx as given by 


OO 


A > s[n]gtx(t — nT). (4.149) 


m=—— 


This is reasonable because the scaling by VEx is usually applied by the analog front end. 

Since x(t) is bandlimited by virtue of the bandlimited pulse shape g,,(7), there exists a sampling 
period 7, such that 1/7, is greater than twice the maximum frequency of g,,(7). According to the 
reconstruction equation in the Nyquist sampling theorem, 


Z(t) = > Z[n]sine((t — nTx)/Tx) (4.150) 
where 
ain) = (nT) (4.151) 
- 
— > s|m]gtx(nT, — mT). (4.152) 
m=—-— OO 


It remains now to create the convolution sum in discrete time. 
Now suppose that 7;, = 7/M,, for some positive integer M,,, where M,, is the oversampling factor. 


If the available sampling rate of the discrete-to-continuous converter does not satisfy this property, 
the results of this section can be followed by the sample rate conversion in Chapter 3. Let 


Itx|N] = 9tx(NTx) (4.153) 
be the oversampled transmit pulse-shaping filter. Then 


XO 


£\ra| = >. s[m]gtx{n — Mixm]. (4.154) 


m=—co 


Recognizing the structure in (4.154), using (3.395), 


z[n| = ( > s|m]d(n — Mon) * Otx|n| (4.155) 


m=— co 


where the first term corresponds to s[m] upsampled by M,,. Recall from Chapter 3 that upsampling by 
M,,, corresponds to inserting M,, — 1 zeros after every s[n]. With upsampling, a discrete-time 


implementation of the transmitter takes on the simple form as illustrated in Figure 4.10; recall that 
tM, 1s the block diagram notation for upsampling. The symbol sequence is upsampled by M,, and 


filtered with the oversampled transmit pulse shape prior to being sent to the discrete-to-continuous 
converter. This allows pulse shaping to be implemented completely in digital and makes the 


generation of x(t) practical. 


s[n] &x[1] D/C x(t) 


Figure 4.10 An implementation of transmit pulse shaping with a combination of upsampling and 
filtering 


The complexity of (4.155) can be reduced through application of multirate identities from Section 
3.4.4. Complexity is a concern because the convolution after upsampling does not exploit the many 


zeros after the upsampling. A more efficient approach is to use a filter bank. Define the C polyphase 
component of T|N] as 
tn] = t[nMi. + (4.156) 
o¢ 
— ia s[m] gtx [rn Mix + € — mM]. (4.157) 
m=-— Co 


i similar way, define the polyphase components of the transmit pulse-shaping filter 


Ix ‘[n] = = gtx(nMix + €) Using this definition, it is possible to build the 0" subsequence as a 
convolution between the symbol stream {s[n]} and the subsampled transmit filters: 


le ©) 
ln] = oD sim] gi? [n —m). (4.158) 
To reconstruct “|”, let EO) (n} he F [nn] upsampled by M,, using the notation in (3.395). Then 


Mtx-1 


EnjJ= >, 2[fn—4. (4.159) 


=0 
Substituting (4.157) and recognizing that E[n] = dop=—oo Z[k]6[n — kM tx! then 


Mix-1 co 
— a 2 (> ar sim] gi? [k — 2) é[n — kM — €]. (4.160) 


£=0 k=- m=—oo 


py oe the symbol stream s[7] is first convolved by each polyphase filter Is | "| The length of 


(2) 
I x a lis approximately M,, times smaller than 9; x ‘fr a} thus resulting in a complexity reduction. Then 


the different filtered sequences are upsampled and delayed to create the desired output ” t[n|, This 
filter bank is illustrated in Figure 4.11. 


D/C E> x(1) 


Figure 4.11 A lower-complexity implementation of pulse shaping. We use the notation z! to 
denote a delay of one sample. 


4.5.2 Receiver Matched Filtering 


In this section, we develop an approach for performing matched filtering in discrete time. Unlike the 
transmitter, the receiver signal processing in Figure 4.6 can be performed using realistic hardware. A 
main disadvantage, though, is that it requires an analog implementation of the matched filter. Usually 
this involves solving a filter design problem to approximate the response of g.,(t) with available 


hardware components. Digital implementation of matched filtering leads to a simpler analog design 
and a more flexible receiver implementation. For example, with a digital implementation only a 
lowpass filter is required prior to sampling. Analog lowpass filters are widely available. It also 
makes the receiver operation more flexible, making it easier to adapt to waveforms with different 
bandwidths. Finally, 1t makes other functions like symbol synchronization easier to implement. 


Our approach 1s based on oversampling the received signal. Let us denote the received signal prior 
to matched filtering as 7(t). We assume that 7(t) has already been filtered so that it can be treated as a 
bandlimited complex baseband signal. Let T, = 7/M,. for some positive integer M/,, such that 1/7, is 
greater than the Nyquist rate of the signal. Let r[m] = r(nT,). From the results in Chapter 3, since r(f) 
is bandlimited, it is possible to replace the continuous-time matched filter in Figure 4.6 witha 
continuous-to-discrete converter, a discrete-time filter, and a discrete-time continuous converter as 
shown in Figure 4.12. 


r(t) “a g |n| eee 


= MT, 


Figure 4.12 An en of receive matched Miss in continuous time using discrete-time 
processing 


As the pulse shape is already bandlimited, and thus does not need to be lowpass filtered, the 
discrete-time filter 1s just the scaled oversampled filter 


Reel] — Fy 65( NT, ). (4.161) 
The scaling by 7, can be neglected since it does not impact the SNR. 
The combination of discrete-to-continuous conversion with 7. followed by continuous-to-discrete 


conversion 7M. can be further simplified. In fact, this is just downsampling by M.. This leads to a 
simplified system as illustrated in Figure 4.13. Mathematically, the output can be written as 


y[n] = we Orx(k]r[nM,x — kl}. (4.162) 


k=—00 
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I; 
Figure 4.13 An implementation of receive matched filtering in continuous time using discrete-time 
processing after simplifying 
The system in Figure 4.13 convolves the sampled received signal by g,.,[7], then downsamples by /,, 
to produce y[n]. 


The computational complexity of Figure 4.13 can be further reduced by recognizing that many 
computations are performed to obtain samples that are subsequently discarded by the downsampling 
operation. This is not inefficient. As an alternative, consider the application of some multirate signal 
processing identities. First note that 


y[n] = rk] grx(n Mix — kl (4.163) 


r(pMrx iE m|9rx [nMrx — pM;x — m| (4.164) 


Myx—-1 oo 

= >! rf] grx{[Mrx(n — p) — m] (4.165) 
m=0 p=-co 
Mrx-1 00 

= a. ¥ r™ 19) grx(Mrx(n — p — 1) + Mex — ml] (4.166) 
m=0 p=-c 
Myx-1 «x 


PM) fy] g Mrx—™) In, — p — 1] (4.167) 
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fe) — 7) tr 
where gix” a lis the m! h polyphase component of g,,[n] (with gi lh ] = grx’ |? r}) and r)[n] is 


the m' polyphase component of r[7]. An implementation of this alternative structure is presented in 


Figure 4.14. While it does not look neat, counting the number of additions and multiplications reveals 
that it is a far more efficient implementation. 
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Figure 4.14 Exchange of downsampling and filtering operations 


4.6 Summary 


¢ The complex pulse-amplitude modulator involves bit-to-symbol mapping, pulse shaping, and 
gain. Normalizations of the constellation and transmit pulse shape are required so that the 
power of the transmitted signal comes only from the added gain. 


¢ The optimal receiver for the AWGN channel involves filtering and symbol rate sampling. To 
maximize the received SINR, the receive filter should be matched to the transmit pulse shape. 
The combined transmit pulse shape and matched filter should be a Nyquist pulse shape, to 
satisfy the zero-ISI condition. 


¢ Maximum likelihood detection of symbols in an AWGN channel involves finding the scaled 
symbol in the constellation that is closest to the observed sample. The detection is applied 
separately on each symbol thanks to the Nyquist pulse shape. Sometimes the detection algorithm 
can be simplified by recognizing symmetries and structure in the constellation. 


¢ The performance of a maximum likelihood symbol detector is evaluated through the probability 
of symbol error, which is a function of the SNR and the constellation. Since it is hard to 
compute directly, it is often useful to compute the union upper bound, which depends on the 
SNR, the minimum distance of the constellation, and the number of points. The union bound is 
loose except at high SNR. 


¢ Pulse shaping can be performed in discrete time at the transmitter, through a combination of 
upsampling and filtering with the oversampled pulse shape. The derivation is based on the 
Nyquist reconstruction formula. Complexity can be reduced by using a filter-bank structure that 
performs filtering prior to upsampling. 


¢ Matched filtering can be performed in discrete time at the receiver through a combination of 
filtering and downsampling. The derivation uses results on discrete-time processing of 
continuous-time bandlimited signals. Complexity can be further reduced using a filter-bank 
structure, where the filtering happens after the downsampling. 


Problems 


1. Mr. Q. A. Monson has come up with a new M = 8-point constellation, illustrated in Figure 
4.15. 


(a) Compute £, for this constellation, assuming all symbols are equally likely. 
(b) Compute the scaling factor that makes the constellation in Figure 4.15 have EF, = 1. 
(c) Plot the constellation diagram for the normalized constellation. 


(d) Determine a bit-to-symbol mapping and add it to your constellation plot. Argue why your 
mapping makes sense. 


lm 
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Figure 4.15 An 8-point constellation 
2. Consider a general complex pulse-amplitude modulation scheme with transmitted signal x(t) = 
dn Sin|g(t — nT,) where the pulse shape g(t) = 4 T et the constellation be C = {2, —2, 2j, — 

24, 0} 

(a) What is the appropriate matched filter to apply to this signal? 

(b) What is the average energy of the constellation? 

(c) Suppose that you generate complex Gaussian noise by generating the real and imaginary 
components separately with distribution M(0, 1). How much do you need to scale the 
transmit constellation to achieve a signal-to-noise ratio of 10dB? 

3. Consider a digital communication system that uses Gaussian pulses g,,(¢) = a exp(—ma’Z*) and 
£ix(t) = g,,(t). Let T be the symbol interval. Let g(t) = g,,(2) * 2g,,(2). 

(a) Find the value of o such that the pulse shape satisfies the unit norm property. Use the 
normalized pulse shape for the rest of the problem. 

(b) Prove that Gaussian pulse shapes do not satisfy the zero-ISI criterion. 

(c) Suppose that a is chosen such that g(7) = 0.001. Find a. 

(d) Given that G(/) is the Fourier transform of g(t), define the bandwidth W of g(t) such that 


G(W)/G(0) = 0.001. Find the value of W for the value of a given previously. How does this 
value compare to that obtained froma raised cosine filter with 100% rolloff? 


4. Let 
th sin(3rt/T" . _ _ 
git) =3F a] cos(3rt/T’) (4.168) 


and 
g|n| = ga(nT) (4.169) 


where g,(t) 1s the normalized version of g(t) with unit energy. 


(a) Compute the normalized pulse shape g,(t). 

(b) Compute g[7]. 

(c) Compute Gera ). 

(d) Is g[n] a Nyquist pulse shape? Please justify your answer. 


5. Consider a QAM communication system that employs raised cosine pulse shaping at the 
transmitter and a carrier frequency of 2GHz. In this problem you will answer various questions 
about the relationships between symbol rates and bandwidths. 


(a) Suppose that the symbol rate is 1 megasymbol per second and that the raised cosine has 
50% excess bandwidth. What is the baseband absolute bandwidth? 


(b) Suppose that the symbol rate 1s 1 megasymbol per second and that the raised cosine has 
50% excess bandwidth. What is the passband absolute bandwidth? 


(c) Suppose that the absolute bandwidth of a signal is 1MHz at baseband. What is the best 
symbol rate that can be achieved with a raised cosine pulse? Hint: You can choose any 
raised cosine pulse. 


(d) Suppose that you have a raised cosine pulse shape with absolute bandwidth of 4MHz at 
baseband and excess bandwidth of 50%. What is the symbol rate? 


(ec) Suppose that the absolute bandwidth of your modulated signal is 3MHz at baseband with an 
excess bandwidth of 25%. What is the Nyquist frequency? What is the Nyquist rate? 


6. Consider a variation of problem 5 with a QAM communication system that employs raised 
cosine pulse shaping at the transmitter and a carrier frequency of 3GHz. In this problem you 
will answer various questions about the relationships between symbol rates and bandwidths. 
(a) Suppose that the symbol rate is 1 megasymbol per second and that the raised cosine has 

50% excess bandwidth. What is the baseband absolute bandwidth? 


(b) Suppose that the symbol rate is 1 megasymbol per second and that the raised cosine has 
50% excess bandwidth. What is the passband absolute bandwidth? 


(c) Suppose that the absolute bandwidth of a signal is 1MHz at baseband. What is the maximum 
symbol rate that can be achieved with a raised cosine pulse? 


(d) Suppose that you have a raised cosine pulse shape with absolute bandwidth of 2MHz at 
baseband and excesss bandwidth of 50%. What is the symbol rate? 


(e) Suppose that the absolute bandwidth of your modulated signal is 3MHz at baseband with an 
excess bandwidth of 25%. What is the Nyquist rate? What is the sampling rate at the 
receiver? 

7. Consider a QAM system with symbol rate 1/7 and transmit pulse-shaping filter g,,(2). Suppose 
that the D/C operates at a rate of L/MT where L and M are coprime integers, L > M, and L/MT 
is greater than the Nyquist rate of g,,(Z). 

(a) Determine a straightforward implementation of transmit pulse shaping using the theory of 
multirate signal processing. Find the coefficients of the filters and provide a block diagram 
of the transmitter. 

(b) Using the upsampling and downsampling identities, interchange the upsampling and 
downsampling operations to create an alternative implementation. Find the coefficients of 
the filters and provide a block diagram of the transmitter. 

(c) Which approach is more computationally efficient? 


8. Consider a complex pulse-amplitude modulated digital communication system. Suppose that a 
square root raised cosine is used for the transmit pulse shape and received matched filter with 


50% excess bandwidth and a symbol rate of 1 megasymbol per second. 

(a) Suppose that you employ the structure in Figure 4.10 to implement the pulse shaping at the 
transmitter. Determine the smallest value of tx © Z” such that 7, = 7/M,, will still result 
in perfect reconstruction. 

(b) Determine the coefficients of the finite impulse response filter {Gtx [A bee ~K such that 
2txLk] has 99% of the energy of g,,(7). Essentially choose K, then find g,,[A]. This is an 
example of creating a practical approximation of an infinite-length filter. 

(c) Determine the complexity per symbol required for the filter computations, assuming the 
filter is implemented directly in the time domain. 

(d) Now suppose that you employ a filter bank to implement pulse shaping at the transmitter as 


(m)rz7 : 
in Figure 4.11. Determine the values of 9tx if] given the M,, and g,,[A] you determined 


previously. 
(ec) Determine the complexity per symbol required for the FIR (finite impulse response) filter 
computations, assuming the filters are implemented directly in the time domain. 
(f) Explain the advantage of the filter-bank approach. 
9. Prove that the cascade of a discrete-to-continuous converter operating at a rate of f, followed 
by a continuous-to-discrete converter operating at a rate of f,,, is equivalent to downsampling 


by M as shown in Figure 4.16. 


Figure 4.16 See problem 9 


10. Prove that the zero-ISI property is maintained in the architecture in Figure 4.13 assuming that 
2,x(t) is matched to g,,(t), the composite filter g(t) = g,,(t)*g,,(t) 18 a Nyquist pulse shape, and 


T= T/M,,. In other words, show that y\n| = VExs{n] in the absence of noise. 


11. Consider the problem of designing an approximate linear interpolator. Suppose that x[n] are 
samples froma bandlimited continuous-time function, sampled at the Nyquist rate. Now 
suppose that we want to obtain samples y[n] that were taken with twice the frequency. In other 
words, if the sampling period is T and x[n] = x(nT), then the interpolated function is y[n] = 
x(nT/2). 

(a) First consider optimum interpolation. 
1. Draw the block diagram for the ideal interpolator. The input is x[”] and the output is y[7]. 
Hint: It involves an upsampling by 2 followed by a filter and uses some multirate signal 
processing from Chapter 3. 


1. Write an equation for y[2n] and y[2n + 1]. Simplify to the extent possible. Hint: y[2n] is 
very simple. 

iii. Draw a block diagram that shows how to implement the ideal interpolator using filtering 
before the upsampling. Again, this uses multirate signal processing results from Chapter 
S, 

(b) The computation of y[2” + 1] 1s computationally expensive. Suppose that we want to 
implement a simple averaging instead. Suppose that we average the two adjacent samples to 
perform the interpolation. Let the coefficients of the filter be {i[0], A[1]}, and let 


y[2n + 1] =h[O]y[2n — 2] + Afljy[2n + 2] (4.170) 


be our low-complexity estimate of y[2n+1]. Suppose that you have observations {y[r] } mt 
enough to compute N errors. To be precise, assuming K is odd, then we need K = 2N + 3 to 
compute N errors. Let e[n] = y[2n + 1] — (A[O]y[2n —2] + Al 1]y[2n + 2]) be the error. The 
goal in the remainder of this problem is to use least squares methods to estimate {h[0], A[1]} 


N-1, 2 
such that 2n=o |€l]|" is minimized. 
i. Assuming that A[ 1] = 0, find the least squares estimate of [0] by expanding the sum, 
differentiating, and solving. 
1. Assuming that [0] = 0, find the least squares estimate of h[1] by expanding the sum, 


differentiating, and solving. 


iii. Find the least squares estimate of h[0] and A[1] by expanding the sum, differentiating, 
and solving. 


iv. Formulate and solve the least squares problem for finding A[0] and A[1] in matrix form. 
Note that because of the matrix dimensions, you can find an expression for the inverse. 

v. Now rewrite the solution as a function of estimates of the autocorrelation function R,,[k] 
and interpret your answer. 


12. Suppose that we can model the received signal using the following equation, 
y=xrt+u, (4.171) 


where x is generated uniformly froma 16-QAM constellation (that has been normalized to have 
FE, = 1) and v is distributed as N.(0, 0.025). Note that x and v are independent random 
variables. 


(a) What are the mean and variance of x? 
(b) What are the mean and variance of Re[v]? 
(c) What are the mean and variance of v? 
(d) What are the mean and variance of y? 


(e) We define the SNR as E(\2|"] /E{|v|?] what is the SNR in this case? 
13. Consider an AWGN equivalent system model where 


where s is a symbol from constellation C= {s,, s} and v is AWGN with N (0, N,). 


(a) Derive the probability of symbol error. Essentially, you need to derive P[si- S2\ not use 
the result given in the book. Of course, you can check your result with the book. 
(b) Suppose that C= {1+), -1-j}. Derive the zero-mean normalized constellation. 


(c) Using the normalized constellation, plot the probability of symbol error as a function of 
SNR. You should have SNR in decibels and use a semilogy plot. Also plot using the 
Chernoff upper bound on the Q(-) function. 

14. Consider the maximum likelihood detector for AWGN channels. Suppose there is a scalar 
channel a, potentially complex, that operates on the transmitted symbols x[”]. This is known as 

a flat-fading channel. In this case the signal at the receiver after sampling is 


y(n] = az|n] + v{[n] (4.173) 


where x[n] is a symbol chosen from constellation C and v[n] is AWGN with variance N,. Find 


the maximum likelihood detector given that you know a (which can be a random variable). 
Hint: Solve for arg max,ecfyy V1") k[n], «). 


15. Recall the derivation of the maximum likelihood detector for AWGN channels. Now suppose 
that v[7] is an IID Bernoulli random variable that takes value 0 with probability p and value 1 
with probability 1 — p. Consider the channel model given in (4.174) known as the binary 


symmetric channel, which uses addition modulo 2 (1.e., like an XOR function). Suppose the 
received signal is 


y = (s+ v)mod2, (4.174) 
where s is a symbol chosen from a constellation C = 0, 1. Find the maximum likelihood 


detector. Since the noise is a discrete random variable, we formulate the maximum likelihood 
problem as 


arg max P,\s[y|s = 5] (4.175) 
where P,,[y|s = S] is the conditional probability mass function. Hint: (a) Solve for the 
likelihood function P,,.[y|s = S]. (b) Find the detection rule for 0 and | that maximizes the 
likelihood; that 1s, choose 0 if P,).[y|s = 0] = Py Lys = 1]. 

16. This problem considers an abstraction of a visible light communication system. 


(a) Consider the BPSK modulation format with constellation {1,—1}. Because we need to keep 
the light on while transmitting, we want to create a nonzero-mean constellation. Create a 
modified constellation such that the mean is 5. 


(b) Let s,; denote the scaled | symbol and let s, denote the scaled —1. Suppose that the noise is 
signal dependent. In particular, suppose that ifs, is sent, 


y= sy, + V1 (4.176) 


where v, is N(0, 67), and if s, is sent, 
y = 89 + U2 (4.177) 


where v, is N(0, 207). 
Determine the maximum likelihood detection rule. Specifically, come up with an algorithm 


or formula that takes the observation y and produces as an output either s, or s>. 


17. Consider a digital communication system that sends QAM data over a voice-band telephone 
link with a rate of 1200 symbols/second. Assume that the data 1s corrupted by AWGN. Find the 
SNR that is required, using both the union bound and the exact error expressions, to obtain a 
symbol error probability of 10~* if the bit rate is: 

(a) 2400 bits/sec 
(b) 4800 bits/sec 
(c) 9600 bits/sec 


18. Consider a modulation scheme with M = 3 and the following constellation: 
C={1+j,1+4j,1+ 77}. (4.178) 


(a) Plot the constellation. 
(b) Find and plot the equivalent zero-mean constellation. 


(c) Find and plot the equivalent unit energy constellation. 
(d) What is the normalized constellation’s minimum distance? 
(e) Find the union bound on the probability of symbol error as a function of SNR. 


19. Suppose that we use the following constellation in a general pulse-amplitude modulation 
system: C= {0, 2, 27, —2}. 


(a) Determine the mean of the constellation. Construct the zero-mean constellation. 


(b) Determine the appropriate scaling factor to make the constellation unit energy. Construct the 
zero-mean normalized constellation. 


(c) What is the minimum distance of the zero-mean unit norm constellation? 
(d) Plot this constellation and sketch the Voronoi regions. 


(e) What is the union bound on the probability of error for this constellation assuming an 


AWGN channel? 
20. Consider a modulation scheme with M = 3 and complex constellation 
C = {1,cos(z7/12) + jsin(77/12), cos(#11/12) + jsin(711/12)}. (4.179) 


(a) Normalize the constellation and plot it. Use the normalized constellation for all subsequent 
parts of this problem. 


(b) Find the minimum distance of the constellation. 
(c) Find the union bound on the probability of symbol error as a function of the SNR in an 
AWGN channel. You may assume AWGN with unit variance (1.e., SNR = £,/1). 


(d) Plot the union bound on a semilogy graph. Make sure that SNR is in decibels. You will need 
to figure out how to compute the Q-function (you can use the erfc function in part) and how 
to reverse this process. 


21. Consider a modulation scheme with M = 8 and the complex constellation 


C= 40,4, 454 — Aj, 2-2), 8 — Fj, 9 4945 (4.180) 


and pulse-shaping filter 
9tx,non(t) = sinc?(t/T) (4.181) 


where 7 is the symbol period. 
(a) Determine the mean of the constellation, assuming that each symbol is equally likely. In 


other words, compute / = E,[s[r il] where s[n] is the complex symbol randomly chosen 
from the constellation. 


(b) Find the zero-mean constellation, obtained by removing the mean from C. 
(c) Determine the average energy of the zero-mean constellation £,. 
(d) We need to ensure that the constellation is zero mean. Forma new constellation by 


removing the mean, which we refer to as the zero-mean constellation. Plot the resulting 
constellation on a real/imaginary plot. Do not forget to label properly. 


(e) Compute the scaling factor that makes the zero-mean constellation have unit average power. 


Plot the resulting constellation, which is in fact the normalized constellation with zero mean 
and unit average power. Do not forget to label properly. 


(f) Compute the minimum distance of the normalized constellation, d,,i.. 
(g) Compute the normalized pulse-shaping filter g,,(¢) from gi non(t)- 
(h) Let the transmitted signal be 


20) = VE. >~ s{n]gtx(t — nT) (4.182) 


where s[] is an element of the zero-mean unit energy constellation. Determine the power 
spectral density of x(Z). 


(1) Determine the absolute bandwidth of x(7). 


(j) Determine the union bound on the probability of symbol error as a function of the SNR, that 
is, the ratio £,/N,. Plot the union bound on a semilogy graph. Make sure that the SNR is in 


decibels. Do not forget to label properly. 
22. Consider the transmission of the superposition of two independent messages x, and x, over an 
AWGN channel, where the received signal after sampling is given by 


y=2%1+22+, (4.183) 


where x, is chosen from the ternary constellation C; = {0, 1, 3}, with equal probability, x, is 

drawn independently of x, from the ternary constellation C, = {cj, c>, c3}, also with equal 

probability, and v is zero-mean AWGN with variance o?. 

(a) Choose the constellation points {c), c>, c3} of C, such that C1 1 C2 = ( and the 
constellation resulting from x = x, + x, 1s zero mean. 

(b) Find the normalized constellation of x; that is, find the constellation with unit average 
power. 

(c) Find the minimum distance of the normalized constellation. 

(d) Find the maximum likelihood (ML) detector for detecting x given the observation y. 

(ce) Draw the constellation of x and the Voronoi regions for the ML detector in part (d). 

23. Consider a modulation scheme with M = 4 and complex constellation 


C = {0,2,2j,1 + j}. (4.184) 


(a) We need to ensure that the constellation is zero mean. Determine the mean of the 
constellation assuming that each symbol is equally likely. In other words, compute 
1 = Es|s|n\| where x[n] is randomly chosen from the constellation. Form a new 
constellation by removing the mean. Plot the resulting constellation on a real/imaginary plot. 
(b) Given the modified constellation, with the mean removed, find the normalized constellation 
—that 1s, find the constellation with unit average power—and plot the resulting 
constellation. 


(c) Find the minimum distance of the normalized constellation. 


(d) Find the union bound on the probability of symbol error as a function of the SNR. 


(e) Compute the SNR such that your bound on the probability of symbol error is 10~>. You will 
need to figure out how to compute the Q-function (you can use the evfc function in part) and 
how to reverse this process. Explain how you got your answer. 


Ex _ : 
24. Consider a noise-free system transmitting QPSK symbols operating at Vu 10dB . The input 
to the symbol detector y[7] is given by 
y[n] = V Ex > s|m]g((n — m)T — ta) + v{n], (4.185) 


m 


where g(t) = 2,,(0) * h(D) * g,,(t) is the raised cosine pulse shape with 50% excess bandwidth, 
and where 0 < t, < Tis an unknown symbol delay. Also, let 


yo|n| = VEx > s|m|g((n — m)T) + v{n] (4.186) 
denote the ideal synchronized version. 
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(a) Find E|/y{n| — yolr]|"] as a function of ty. Plot for 0 <1, < 7 and interpret the results. 


(b) Treating the intersymbol interference due to Tp, as noise, using the exact QPSK probability of 
symbol error expression, plot the probability of error for t= 0, 0.257 0.57 0.75T. The x- 
axis of your plot should be the SNR in decibels and the y-axis should be the probability of 
symbol error on a semilogy curve. 

25. Computer Implement an ML detector for 4-QAM, 16-QAM, and 64-QAM. Use the procedure 
outlined in Section 4.4.5 where you generate Y|! ] = VExs{n] + v[n] and apply the detector to 
the result. Use the Monte Carlo estimation approach in (4.148) to estimate the error rate. Plot 
the theoretical curves in (4.147) and verify that they line up when enough symbols are used in 
the estimator. 

26. Computer Create a program that generates an M-QAM waveform. 

(a) Create a function to generate a block of M,;,,, each of which is equally likely. 

(b) Create a function that implements the bit-to-symbol mapping using the labelings in Figure 


4.3. Your code should support M = 4, M = 16, and M = 64. Be sure to use the normalized 
constellation. 


(c) Create a function to generate the oversampled square root raised cosine pulse shape. The 
amount of oversampling /,, should be a parameter. You should use a finite impulse response 
approximation with suitably chosen length. 

(d) Create a function to generate the scaled sampled waveform x[n] = x(n7T/M,,) using the 
functions you have created. 

(ec) Demonstrate that your function works by plotting the output for an input bit sequence 


00011110101101 with 4-QAM and a rolloff of a = 0.5. For your plot, choose V Ey = 5, 
Plot the real and imaginary outputs. 


Chapter 5. Dealing with Impairments 


Communication in wireless channels is complicated, with additional impairments beyond AWGN and 
more elaborate signal processing to deal with those impairments. This chapter develops more 
complete models for wireless communication links, including symbol timing offset, frame timing 
offset, frequency offset, flat fading, and frequency-selective fading. It also reviews propagation 
channel modeling, including large-scale fading, small-scale fading, channel selectivity, and typical 
channel models. 


We begin the chapter by examining the case of frequency-flat wireless channels, including topics 
such as the frequency-flat channel model, symbol synchronization, frame synchronization, channel 
estimation, equalization, and carrier frequency offset correction. Then we consider the ramifications 
of communication in considerably more challenging frequency-selective channels. We revisit each 
key impairment and present algorithms for estimating unknown parameters and removing their effects. 
To remove the effects of the channel, we consider several types of equalizers, including a least 
squares equalizer determined from a channel estimate, an equalizer directly determined from the 
unknown training, single-carrier frequency-domain equalization (SC-FDE), and orthogonal 
frequency-division multiplexing (OFDM). Since equalization requires an estimate of the channel, we 
also develop algorithms for channel estimation in both the time and frequency domains. Finally, we 
develop techniques for carrier frequency offset correction and frame synchronization. The key idea is 
to use a specially designed transmit signal to facilitate frequency offset estimation and frame 
synchronization prior to other functions like channel estimation and equalization. The approach of this 
chapter is to consider specific algorithmic solutions to these impairments rather than deriving optimal 
solutions. 


We conclude the chapter with an introduction to propagation channel models. Such models are used 
in the design, analysis, and simulation of communication systems. We begin by describing how to 
decompose a wireless channel model into two submodels: one based on large-scale variations and 
one on small-scale variations. We then introduce large-scale models for path loss, including the log- 
distance and LOS/NLOS channel models. Next, we describe the selectivity of a small-scale fading 
channel, explaining how to determine if it is frequency selective and how quickly it varies over time. 
Finally, we present several small-scale fading models for both flat and frequency-selective channels, 
including some analysis of the effects of fading on the average probability of symbol error. 


5.1 Frequency-Flat Wireless Channels 


In this section, we develop the frequency-flat AWGN communication model, using a single-path 
channel and the notion of the complex baseband equivalent. Then we introduce several impairments 
and explain how to correct them. Symbol synchronization corrects for not sampling at the correct 
point, which is also known as symbol timing offset. Frame synchronization finds a known reference 
point in the data—for example, the location of a training sequence—to overcome the problem of 
frame timing offset. Channel estimation is used to estimate the unknown flat-fading complex channel 
coefficient. With this estimate, equalization is used to remove the effects of the channel. Carrier 
frequency offset synchronization corrects for differences in the carrier frequencies between the 
transmitter and the receiver. This chapter provides a foundation for dealing with impairments in the 
more complicated case of frequency-selective channels. 


5.1.1 Discrete-Time Model for Frequency-Flat Fading 


The wireless communication channel, including all impairments, is not well modeled simply by 
AWGN. A more complete model also includes the effects of the propagation channel and filtering in 
the analog front end. In this section, we consider a single-path channel with impulse response 


h_(t) = a6(t — 7a). (5.1) 


Based on the derivations in Section 3.3.3 and Section 3.3.4, this channel has a complex baseband 
equivalent given by 


h(t) = Bae~!?"FeTginc( B(t — Tq)) (5.2) 
and pseudo-baseband equivalent channel 


Rob lt) = ave janf "4 O(t = Ta). (5.3) 


See Example 3.37 and Example 3.39 to review this calculation. The Bsinc(t) term is present because 
the complex baseband equivalent is a baseband bandlimited signal. In the frequency domain 


h(f) = rect(f/B)ae tle" ea raf 
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we observe that |H(f)| is a constant for f © [—B/2, B/2]. This channel is said to be frequency flat 
because it is constant over the bandwidth of interest to the signal. Channels that consist of multipaths 
can be approximated as frequency flat if the signal bandwidth is much less than the coherence 
bandwidth, which is discussed further in Section 5.8. 

Now we incorporate this channel into our received signal model. The channel in (5.2) convolves 
with the transmitted signal prior to the noise being added at the receiver. As a result, the complex 
baseband received signal prior to matched filtering and sampling is 


fe] = /E,h(t) * Gtx(t) * y. s|m]é(t — mT) + v(t) (5.5) 
= \/E,ae 27 Fer S- s|m]gtx(t — mT — Tq) + v(t). (5.6) 


To ensure that 7(¢) is bandlimited, henceforth we suppose that the noise has been lowpass filtered to 
have a bandwidth of B/2. For simplicity of notation, and to be consistent with the discrete-time 


representation, we let i = ae? FT \/E and write 
2 4) 
rf}=A >. s|m]gtx(t — mT — Tq) + v(t). (5.7) 
m=— Co 


The factor of V £x is included inh since only the combined scaling is important from the perspective 
of receiver design. Matched filtering and sampling at the symbol rate give the received signal 


oe 


y(n] =h y. s|m|g((n — m)T — Ta) + v[n] (5.8) 


m= — oo 


where g(t) is a Nyquist pulse shape. Compared with the AWGN received signal model y[”] = s[n] + 
v[n], there are several sources of distortion, which must be recognized and corrected. 


One impairment is caused by symbol timing error. Suppose that ty is a fraction of a symbol period, 
that is, t7 © [0, 7). This models the effect of sample timing error, which happens when the receiver 
does not sample at precisely the right point in time. Under this assumption 

y(n] = hs|n|g(ta) +h >. s|{m]g((n — m)T — Ta) + vn] . (5. 
Neca eco a 


m=~n 
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Intersymbol interference is created when the Nyquist pulse shape is not sampled exactly at nT, since 
g(nT + Tg) 1s not generally equal to d[n]. Correcting for this fractional delay requires symbol 


synchronization, equalization, or a more complicated detector. 
A second impairment occurs with larger delays. For illustration purposes, suppose that ty = dT for 


some integer d. This is the case where symbol timing has been corrected but an unknown propagation 
delay, which is a multiple of the symbol period, remains. Under this assumption 


y(n] = h > s|mlg((n — m)T — Tg) + v[n] (5.10) 
re 

= fi > s|mlg((n — m)T — dT) + vIn] (5.11) 

= hs{n — d] + v[n]. (5.12) 


Essentially integer offsets create a mismatch between the indices of the transmitted and received 
symbols. Frame synchronization is required to correct this frame timing error impairment. 


Finally, suppose that the unknown delay ty has been completely removed so that ty = 0. Then the 
received signal is 


y[n| = hs[n] + v[n]. (5.13) 


Sampling and removing delay leaves distortion due to the attenuation and phase shift in 4. Dealing 
with / requires either a specially designed modulation scheme, like DQPSK (differential quadrature 
phase-shift keying), or channel estimation and equalization. 


It is clear that amplitude, phase, and delay if not compensated can have a drastic impact on system 
performance. As a consequence, every wireless system is designed with special features to enable 
these impairments to be estimated or directly removed in the receiver processing. Most of this 
processing is performed on small segments of data called bursts, packets, or frames. We emphasize 
this kind of batch processing in this book. Many of the algorithms have adaptive extensions that can 
be applied to continually estimate impairments. 


5.1.2 Symbol Synchronization 

The purpose of symbol synchronization, or timing recovery, is to estimate and remove the fractional 
part of the unknown delay ty, which corresponds to the part of the error in [0, 7). The theory behind 
these algorithms is extensive and their history is long [309, 226, 124]. The purpose of this section 1s 
to present one of the many algorithm approaches for symbol synchronization in complex pulse- 
amplitude modulated systems. 

Philosophically, there are several approaches to synchronization as illustrated in Figure 5.1. There 
is a pure analog approach, a combined digital and analog approach where digital processing is used 
to correct the analog, and a pure digital approach. Following the DSP methodology, this book 
considers only the case of pure digital symbol synchronization. 
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Figure 5.1 Different options for correcting symbol timing. The bottom approach relies only on 
DSP. 


We consider two different strategies for digital symbol synchronization, depending on whether the 
continuous-to-discrete converter can be run at a low sampling rate or a high sampling rate: 


¢ The oversampling method, illustrated in Figure 5.2, 1s suitable when the oversampling factor 
M,,, is large and therefore a high rate of oversampling is possible. In this case the 


synchronization algorithm essentially chooses the best multiple of 7/M,, and adds a suitable 
integer delay k* prior to downsampling. 
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Figure 5.2 Oversampling method suitable when M,, is large 


¢ The resampling method is illustrated in Figure 5.3. In this case a resampler, or interpolator, is 
used to effectively create an oversampled signal with an effective sample period of 7/M,, even 


if the actual sampling is done with a period less than 7/M,, but still satisfying Nyquist. Then, as 
with the oversampling method, the multiple of 7/M,,, is estimated and a suitable integer delay k* 
is added prior to the downsampling operation. 
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Figure 5.3 Resampling or interpolation method suitable when /,, is small, for example, N = 2 


A proper theoretical approach for solving for the best of ty would be to use estimation theory. For 


example, it is possible to solve for the maximum likelihood estimator. For simplicity, this section 
considers an approach based on a cost function known as the maximum output energy (MOE) 
criterion, the same approach as in [169]. There are other approaches based on maximum likelihood 
estimation such as the early-late gate approach. Lower-complexity implementations of what is 
described are possible that exploit filter banks; see, for example, [138]. 


First we compute the energy output as a way to justify each approach for timing synchronization. 
Denote the continuous-time output of the matched filter as 


y(t) =h >. s|m]g(t — mT — Tq) + v(t). (5.14) 


m=— 0 


The output energy after sampling by n7T'+ T is 


Jmon(t) = E ly (n + r)/?| (5.15) 
ses - 
= |h|? >. lg(mT +7 — 7a) |* + No. (5.16) 


m=— co 


Suppose that ty = dT + t,.,, and 7 = dT + Ttrac; then 


‘° ¢ 
" ly (nT = T) )] = i |A|? > lg (mr + dT + Tfrac — dT — fica) F i No (5.17) 
m=—oo 


= | |? S- lg (m fi Sib Tfrac Np Trac) |? + No (5. l 8) 
m=—co 
with a change of variables. Therefore, while the delay t can take an arbitrary positive value, only the 
offset that corresponds to the fractional delay has an impact on the output energy. 
The maximum output energy approach to symbol timing attempts to find the t that maximizes 
JMor(t) where t © [0, 7]. The maximum output energy solution is 


Tq = arg max JMor(T). (5.19) 
ré(0,T) 


The rationale behind this approach is that 


E ||y (nT +7) P| < |hl2I9(0)|2 + No (5.20) 


where the inequality is true for most common pulse-shaping functions (but not for arbitrarily chosen 
pulse shapes). Thus the unique maximum of Jy;9,(t) corresponds to T= Ty. The values of Jyyop(t) are 
plotted in Figure 5.4 for the raised cosine pulse shape. A proof of (5.20) for sinc and raised cosine 
pulse shapes is established in Example 5.1 and Example 5.2. 
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Figure 5.4 Plot of Jyog(t) for t © [-7/2, T/2], in the absence of noise, assuming a raised cosine 
pulse shape (4.73). The x-axis is normalized to T. As the rolloff value of the raised cosine is 
increased, the output energy peaks more, indicating that the excess bandwidth provided by 
choosing larger values of rolloff a translates into better symbol timing using the maximum output 
energy cost function. 


Example 5.1 
Show that the inequality in (5.20) is true for sinc pulse shapes. 


Answer: For convenience denote 


mI + T —T, ; 
sinc (a) = sinc(m + a) (5.21) 


where “ = T * Let g(t) = sinc(t+a) and g[m] = g(m7) = sinc(m+a), and let g(f) and g(e”/) be the 
CTFT of g(f) and DTFT of g[m] respectively. Then g(f) = e?"% rect(f) and 


i _ fon 
J2rf) — — res 5.22 
aem™)=7 Y x(5-4) (5.22) 
fe @) 
= pS e274 F— rect(f — n). (5.23) 
n=—co 


By Parseval’s theorem for the DTFT we have 


0° 1/2 . 
D> |sinc(m+a)?= f  |g(e?*)Pay (5.24) 
m=—oo —1/2 
2| oO , 
=i pa e?7F—)rect(f —n)| df (5.25) 
—1/2 m=— 00 
1/2 © 7 2 
< | >. eral" rect( f — n) df (5.26) 
—1/2,, n=—oo 
1/2 
=f 5 | rect(f —n))?df (5.27) 
1/2 n=— oO 
1/2 
= ldf (5.28) 
~1/2 
aa (5.29) 
9(0)| (5.30) 


where the inequality follows because |}'a,| < ¥ |a,|. Therefore, 


oo 


S— |g(mP +7 — ta) |? < |9(0)/?. (5.31) 


m=— oo 


Example 5.2 
Show that the MOE inequality in (5.20) holds for raised cosine pulse shapes. 


Answer: The raised cosine pulse 


cos(7at) 


alt) = asine(t/T a 5.32 
Gre(t) = sine(t/ )j — 40242 /T? (5.32) 
is a modulated version of the sinc pulse shape. Since 
cos(7at) 
————— | <1 5.33 
1 — 4a2t?/T?2| — (5.33) 
it follows that 
lgre(t)|? < |sine(t/T)|? (5.34) 


and the result from Example 5.1 can be used. 


Now we develop the direct solution to maximizing the output energy, assuming the receiver 
architecture with oversampling as in Figure 4.10. Let r[n] denote the signal after oversampling or 
resampling, assuming there are M/, samples per symbol period. Let the output of the matched receiver 


filter prior to downsampling at the symbol rate be 
oo 


y(n] = a r[m]grx{n — m]. (S35) 


m=-— oo 


We use this sampled signal to compute a discrete-time version of Jyjop(T) given by 
Jmog,a[k] = E ofr Mrs + RI? (5.36) 


where k is the sample offset between 0, 1,..., M4,, — 1 corresponding to an estimate of the fractional 
part of the timing offset given by k7/M,,. To develop a practical algorithm, we replace the 
expectation with a time average over P symbols, thanks to ergodicity, so that 
1, P= 
Imon.elk] = 5 S> IrlpMrx + kl”. (5.37) 


p=0 


Looking for the maximizer of Jyop lk] over k=0,1,..., MM, — 1 gives the optimum sample 4* and 
an estimate of the symbol timing offset k*7/M..,. 
The optimum correction involves advancing the received signal by k* samples prior to 


downsampling. Essentially, the synchronized data is Y In] = y[n Myx + k*], Equivalently, the signal 
can be delayed by k* — M,,. samples, since subsequent signal processing steps will in any case 
correct for frame synchronization. 


The main parameter to be selected in the symbol timing algorithms covered in this section is the 
oversampling factor M,. This decision can be based on the residual ISI created because the symbol 


timing is quantized. Using the SINR from (4.56), assuming / is known perfectly, matched filtering at 
the receiver, and a maximum symbol timing offset 7/2M,,, then 


T_\|? 
g (si) 


5 : 
g (mr a a) | + No 


||? 
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This value can be used, for example, with the probability of symbol error analysis in Section 4.4.5 to 
select a value of M,, such that the impact of symbol timing error is acceptable, depending on the SNR 


and target probability of symbol error. An illustration is provided in Figure 5.5 for the case of 4- 
QAM. In this case, a value of M,, = 8 leads to less than 1dB of loss at a symbol error rate of 10. 
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Figure 5.5 Plot of the exact symbol error rate for 4-QAM from (4.147), substituting SINR for 
SNR, with h| = VEx for different values of M,,, 1n (5.38), assuming a raised cosine pulse shape 


with a = 0.25. With enough oversampling, the effects of timing error are small. 


5.1.3 Frame Synchronization 


The purpose of frame synchronization is to resolve multiple symbol period delays, assuming that 
symbol synchronization has already been performed. Let d denote the remaining offset where d = t4/T 
— k*/M, 


me assuming the symbol timing offset was corrected perfectly. Given that, [SI is eliminated and 


y[n] = hs[n — d] + v[n]. (5.39) 


To reconstruct the transmitted bit sequence it is necessary to know “where the symbol stream starts.” 
As with symbol synchronization, there is a great deal of theory surrounding frame synchronization 
[343, 224, 27]. This section considers one common algorithm for frame synchronization in flat 
channels that exploits the presence of a known training sequence, inserted during a training phase. 


Most wireless systems insert reference signals into the transmitted waveform, which are known by 
the receiver. Known information is often called a training sequence or a pilot symbol, depending on 
how the known information is inserted. For example, a training sequence may be inserted at the 
beginning of a transmission as illustrated in Figure 5.6, or a few pilot symbols may be inserted 
periodically. Most systems use some combination of the two where long training sequences are 
inserted periodically and shorter training sequences (or pilot symbols) are inserted more frequently. 
For the purpose of explanation, it is assumed that the desired frame begins at discrete time n = 0. The 
total frame length is N,,,, including a length N,, training phase and an N,,, — NV, data phase. Suppose 


N, re l . . . . 
that 141") }n“o is the training sequence known at the receiver. 


Ssyne[?] 


Training phase Data phase 
Figure 5.6 A frame structure that consists of periodically inserted training and data 


One approach to performing frame synchronization is to correlate the received signal with the 
training sequence to compute 


Wi] 
Rin] = >; t*(k]y[n + ki] (5.40) 
k=0 
and then to find 
d = arg max |R[n]|. (5.41) 


The maximization usually occurs by evaluating R[”] over a finite set of possible values. For example, 
the analog hardware may have a carrier sense feature that can determine when there is a significant 
signal of interest. Then the digital hardware can start evaluating the correlation and looking for the 
peak. A threshold can also be used to select the starting point, that is, finding the first value of n such 
that |R[7]| exceeds a target threshold. An example with frame synchronization is provided in Example 
=a, 


Example 5.3 


Consider a system as described by (5.39) with h = 0.5e!"/°./E,, , N= 7, and N,4 = 21 withd = 0. 
We assume 4-QAM for the data transmission and that the training consists of the Barker code of length 
7 given by t4["] fn=o = {1,1,1,—1,—1,1,—1}. The SNR is 5dB. We consider a frame snippet 
that consists of 14 data symbols, 7 training symbols, 14 data symbols, 7 training symbols, and 14 data 
symbols. In Figure 5.7, we plot |R[]| computed from (5.40). There are two peaks that correspond to 
locations of our training data. The peaks happen at 21 and 42 as expected. If the snippet was delayed, 
then the peaks would shift accordingly. 
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Figure 5.7 The absolute value of the output of a correlator for frame synchronization. The details 
of the simulation are provided in Example 5.3. Two correlation peaks are seen, corresponding to 
the location of the two training sequences. 


A block diagram of a receiver including both symbol synchronization and frame synchronization 
operations is illustrated in Figure 5.8. The frame synchronization happens after the downsampling and 


prior to the symbol detection. Fixing the frame synchronization requires advancing the signal by d 
symbols. 
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Figure 5.8 Receiver with symbol synchronization based on oversampling and frame 
synchronization 


The frame synchronization algorithm may find a false peak if the data is the same as the training 
sequence. There are several approaches to avoid this problem. First, training sequences can be 
selected that have good correlation properties. There are many kinds of sequences known in the 
literature that have good autocorrelation properties [116, 250, 292], or periodic autocorrelation 
properties [70, 267]. Second, a longer training sequence may be used to reduce the likelihood of a 
false positive. Third, the training sequence can come from a different constellation than the data. This 
was used in Example 5.3 where a BPSK training sequence was used but the data was encoded with 
4-QAM. Fourth, the frame synchronization can average over multiple training periods. Suppose that 
the training is inserted every N,,; symbols. Averaging over P periods then leads to an estimate 


a 
d= _—s> 2 t*|k]y[n + k + pNtot]| - (5.42) 


p=0 | k=0 


Larger amounts of averaging improve performance at the expense of higher complexity and more 
storage requirements. Finally, complementary training sequences can be used. In this case a pair of 
training sequences {t,[m]} and {t,[n]} are designed such that 

Ner—1 px fp), -) 4 4*[- LE+N, 

ro tilkly[n +k] + to[k]y[n + k + Meot] has a sharp correlation peak. Such sequences are 


discussed further in Section 5.3.1. 


5.1.4 Channel Estimation 


Once frame synchronization and symbol synchronization are completed, a good model for the 
received signal is 


y(n] = hs[n] + v[n]. (5.43) 


The two remaining impairments are the unknown flat channel / and the AWGN v[n]. Because h 
rotates and scales the constellation, the channel must be estimated and either incorporated into the 
detection process or removed via equalization. 


The area of channel estimation is rich and the history long [40, 196, 369]. In general, a channel 
estimation problem is handled like any other estimation problem. The formal approach 1s to derive an 
optimal estimator under assumptions about the signal and noise. Examples include the least squares 
estimator, the maximum likelihood estimator, and the MMSE estimator; background on these 
estimators may be found in Section 3.5. 


In this section we emphasize the use of least squares estimation, which is also the ML estimator 
when used for linear parameter estimation in Gaussian noise. To use least squares, we build a 
received signal model from (5.43) by exploiting the presence of the known training sequence from n = 


0,1,...,,,— 1. Stacking the observations in (5.43) into vectors, 
v0 (0 v0 
cs cee — 
yl Ne — 1] tLNee — 1] ol Ne —1] 


which becomes compactly 
y=th+v. (5.45) 


We already computed the maximum likelihood estimator for a more general version of (5.46) in 
Section 3.5.3. The solution was the least squares estimate, given by 


h = (t*t)!t*y (5.46) 
Dnao t*[nlylnl (5.47) 
— a ra of i 
pao t [n]t[n) 


Essentially, the least squares estimator correlates the observed data with the training data and 
normalizes the result. The denominator is just the energy in the training sequence, which can be 
precomputed and stored offline. The numerator is calculated as part of the frame synchronization 
process. Therefore, frame synchronization and channel estimation can be performed jointly. The 
receiver, including symbol synchronization, frame synchronization, and channel estimation, is 
illustrated in Figure 5.9. 
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Figure 5.9 Receiver with symbol synchronization based on oversampling, frame synchronization, 
and channel estimation 


Example 5.4 

In this example, we evaluate the squared error in the channel estimate. We consider a similar system 
to the one described in Example 5.3 with a length N,,= 7 Barker code used for a training sequence, 
and a least squares channel estimator as in (5.46). We perform a Monte Carlo estimate of the channel 


by generating a noise realization and estimating the channel h\n) for n= 1,2,..., 1000 realizations. 

> 2 
In Figure 5.10, we evaluate the estimation error as a function of the SNR, which is defined as |A|*/oy 
, and is 5dB in this example. We plot the error for one realization of a channel estimation, which is 


Tr 1 999 Tr..142 
given by |h—h{0]| : and the mean squared error, which is given by 1000 din=o |h—hln . The plot 


shows how the estimation error, based both on one realization and on average, decreases with SNR. 
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Figure 5.10 Estimation error as a function of SNR for the system in Example 5.4. The error 
estimate reduces as SNR increases. 


Example 5.5 


In this example, we evaluate the squared error in the channel estimate for different lengths of 4-QAM 
training sequences, an SNR of 5dB, the channel as in Example 5.3, and a least squares channel 
estimator as in (5.46). We perform a Monte Carlo estimate of the squared error of one realization and 
the mean squared error, as described in Example 5.4. We plot the results in Figure 5.11, which show 
how longer training sequences reduce estimation error. Effectively, longer training increases the 
effective SNR through the addition of energy through coherent combining in z phe 0 t in|t|n] and 


Niger 4. 
more averaging in dno e(njr[n], 
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Figure 5.11 Estimation error as a function of the training length for the system in Example 5.5. The 
error estimate reduces as the training length increases. 


5.1.5 Equalization 


Assuming that channel estimation has been completed, the next step in the receiver processing is to 
use the channel estimate to perform symbol detection. There are two reasonable approaches; both 


involve assuming that /2 is the true channel estimate. This is a reasonable assumption if the channel 
estimation error is small enough. 


The first approach is to incorporate h into the detection process. Replacing VEX by / in (4.105), 
then 


s|n] = arg min |y[n] — hs|?. (5.48) 
sEC 
Consequently, the channel estimate becomes an input into the detector. It can be used as in (5.48) to 


scale the symbols during the computation of the norm, or it can be used to create a new constellation 


C = thei, he2,..., hem} and detection performed using the scaled constellation as 


8[n] = arg min |y[n] — s|?. (5.49) 
sEC 


The latter approach is useful when N,,,; is large. 
An alternative approach to incorporating the channel into the ML detector is to remove the effects 


of the channel prior to detection. For h# 0. 


2 
n y|n . 
arg min uin |y[7] —hs|? = arg min ln\~? 1 | vin) —s = arg min | lr] — 8 . (5.50) 
s€C h sec | h | 
The process of creating the signal / In] /h is an example of equalization. When equalization is used, 


the effects of the channel are removed from y[n] and a standard detector can be applied to the result, 
leveraging constellation symmetries to reduce complexity. The receiver, including symbol 
synchronization, frame synchronization, channel estimation, and equalization, 1s illustrated in Figure 
a. 


The probability of symbol error with channel estimation can be computed by treating the estimation 
error as noise. Let h=h+ Ne where /t — h= hei is the estimation error. For a given channel / and 
estimate /h the equalized received signal is 


s[n] = -(h + he)s{n] + Sail (5.51) 


~~ 


= s(n] + Pe a ~ Eel (5.52) 
h h 


h\2 
It is common to treat the middle interference term as additional noise. Moving the common hI to the 
numerator, 
i |h|? 
Sake = = ——> (5.53) 
le 9 9 / 
lhe|* + 0% 
This can be used as part of a Monte Carlo simulation to determine the impact of channel estimation. 
Since the receiver does not actually know the estimation error, it is also common to consider a 


= _ _E | he | . 

variation of the SINR expression where the variance of the estimate Mel is used in place of the 

instantaneous value (assuming that the estimator is unbiased so zero mean). Then the SINR becomes 
(a? 

E | |el?] + 0 

This expression can be used to study the impact of estimation error on the probability of error, as the 


mean squared error of the estimate is a commonly computed quantity. A comparison of the probability 
of symbol error for these approaches is provided in Example 5.6. 


SINR = (5.54) 


Example 5.6 

In this example we evaluate the impact of channel estimation error. We consider a system similar to 
the one described in Example 5.4 with a length N,,= 7 Barker code used for a training sequence, and 
a least squares channel estimator as in (5.46). We perform a Monte Carlo estimate of the channel by 
generating a noise realization and estimating the channel h [7 r| forn=1,2,..., 1000 realizations. For 
each realization, we compute the error, insert into the SINR h. in (5.53), and use that to compute the 
exact probability of symbol error from Section 4.4.5. We then average this over 1000 Monte Carlo 
simulations. We also compare in Figure 5.12 with the probability of error assuming no estimation 
error with SNR |; PE,/N, and with the probability of error computed using the average error from the 
SINR in (5.54). We see in this example that the loss due to estimation error is about 1dB and that 


there is little difference in the average probability of symbol error and the probability of symbol error 
computed using the average estimation error. 
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Figure 5.12 The probability of symbol error with 4-QAM and channel estimation using a length N,, 


= 7 training sequence. The probability of symbol error with no channel estimation error is 
compared with the average of the probability of symbol error including the instantaneous error, and 
the probability of symbol error using the average estimation error. There is little loss in using the 
average estimation error. 


5.1.6 Carrier Frequency Offset Synchronization 


Wireless communication systems use passband communication signals. They can be created at the 
transmitter by upconverting a complex baseband signal to carrier f, and can be processed at the 


receiver to produce a complex baseband signal by downconverting froma carrier f,. In Section 3.3, 
the process of upconversion to carrier f,, downconversion to baseband, and the complex equivalent 
notation were explained under the important assumption that f, 1s known perfectly at both the 
transmitter and the receiver. In practice, f, is generated from a local oscillator. Because of 
temperature variations and the fact that the carrier frequency is generated by a different local 
oscillator and transmitter and receiver, in practice f, at the transmitter is not equal to fe at the 
receiver as illustrated in Figure 5.13. The difference between the transmitter and the receiver 


fe = fo— fe is the carrier frequency offset (or simply frequency offset) and is generally measured in 
hertz. In device specification sheets, the offset is often measured as |f,|/f, and is given in units of parts 


per million. In this section, we derive the system model for carrier frequency offset and present a 
simple algorithm for carrier frequency offset estimation and correction. 


cos(21f.t) 


Figure 5.13 Abstract block diagram of a wireless system with different carrier frequencies at the 
transmitter and the receiver 
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Let 
p(t) = Re [x(t)e?*4«*) (5.55) 


be the passband signal generated at the transmitter. Let the observed passband signal at the receiver at 
carrier f, be 


rp(t) = | h.(t — T)xp(7)dr (5.56) 
= r;(t) cos(27f,t) — rq(t) sin(27 fet) (5.57) 

age oes ee . 

=- (r(t)el?* Fe +r*(t)e j2n fi “4 (5.58) 


~~ 


t 


where r(¢) = r(t) + jr,(t) 1s the complex baseband signal that corresponds to 7,(¢) and we use the fact 


r 
p 
a | : et 
that Re(z) = 9(@ + ©") for the last step. Now suppose that r,(7) is downconverted using carrier Ie 
to produce a new signal r'(t). Then the extracted complex baseband signal is (ignoring noise) 


r'(t) = 2Bsine(tB) « (rp(t)ePrF*) (5.59) 


Focusing on the last term in (5.59) with fe = fe - fe then 


rp(t)e Hse? = 1, (t)eJ2"Ue— fo)! (5.60) 
= Glan loty (pense, (5.61) 
Substituting in 7,(¢) from (5.58), 
ee ae oe 
rp(tje 127 fet = sie +r (t)e)?* fete jan fet (5.62) 


Lowpass filtering (assuming, strictly speaking, a bandwidth of B + |f,|) and correcting for the factor of 
1 
2 leaves the complex baseband equivalent 

r'(t) — el2t ety (t), (5.63) 


Carrier frequency offset results in a rolling phase shift that happens after the convolution and depends 
on the difference in carrier f,. As the phase shift accumulates, failing to synchronize can quickly lead 
to errors. 

Following the methodology of this book, it is of interest to formulate and solve the frequency offset 
estimation and correction problem purely in discrete time. To that end a discrete-time complex 
baseband equivalent model is required. 

To formulate a discrete-time model, we focus on the sampled signal after matched filtering, still 
neglecting noise, as 


y(t) = fre — T)Grx(T dr (5.64) 
= f ePehOn(t — r)gee(t)ar (5.65) 
= elintet | r(t — r)e PFT g, (7) dr. (5.66) 


Suppose that the frequency offset f, is sufficiently small that the variation over the duration of g,,(7) 
can be assumed to be a constant; then e ?¥etg. (t) = g,,(t). This is reasonable since most of the 
energy in the matched filter g,,(¢) is typically concentrated over a few symbol periods. Assuming this 
holds, 


y(t) = aad | r(t — T)grx(T)dr. (5.67) 


As a result, it is reasonable to model the effects of frequency offset as if they occurred on the matched 
filtered signal. 


Now we specialize to the case of flat-fading channels. Substituting for 7(f), adding noise, and 
sampling gives 
y[n] = e)?™"h >. s|m|g((n — m)T — Tq) + v[n] (5.68) 


m= — OO 
where ¢ = f,7 is the normalized frequency offset. Frequency offset introduces a multiplication by the 


discrete-time complex exponential e?*fe™n, 


To visualize the effects of frequency offset, suppose that symbol and frame synchronization has 
been accomplished and only equalization and channel estimation remain. Then the Nyquist property of 
the pulse shape can be exploited so that 


y(n] = e)?2—Nh sin] + vuln]. (5.69) 


Notice that the transmit symbol is being rotated by exp(j27¢). As n increases, the offset increases, 
and thus the symbol constellation rotates even further. The impact of this is an increase in the number 
of symbol errors as the symbols rotate out of their respective Voronoi regions. The effect is illustrated 


in Example 5.7. 


Example 5.7 


Consider a system with the frequency offset described by (5.69). Suppose that ¢ = 0.05. In Figure 
5.14, we plot a 4-QAM constellation at times n = 0, 1, 2, 3. The Voronoi regions corresponding to the 
unrotated constellation are shown on each plot as well. To make seeing the effect of the rotation 
easier, one point is marked with an x. Notice how the rotations are cumulative and eventually the 
constellation points completely leave their Voronoi regions, which results in detection errors even in 
the absence of noise. 


N= 1 N=2 N=3 

Figure 5.14 The successive rotation of a 4-QAM constellation due to frequency offset. The 

Voronoi regions for each constellation point are indicated with the dashed lines. To make the 
effects of the rotation clearer, one of the constellation points is marked with an x. 


Correcting for frequency offset is simple: just multiply y[n] by e?"€". Unfortunately, the offset is 
unknown at the receiver. The process of correcting for ¢ is known as frequency offset synchronization. 
The typical method for frequency offset synchronization involves first estimating the offset €, then 


correcting for it by forming the new sequence &XP( —j2men)y|n] with the phase removed. There are 
several different methods for correction; most employ a frequency offset estimator followed by a 
correction phase. Blind offset estimators use some general properties of the received signal to 
estimate the offset, whereas non-blind estimators use more specific properties of the training 


sequence. 

Now we review two algorithms for correcting the frequency offset in a flat-fading channel. We 
start by observing that frequency offset does not impact symbol synchronization, since the ee” 
cancels out the maximum output energy maximization caused by the magnitude function (see, for 
example, (5.15)). As a result, ISI cancels and a good model for the received signal is 


y(n] = el27erh gin -- d| + v[n] (5.70) 


where the offset ¢, the channel h, and the frame offset d are unknowns. In Example 5.8, we present an 
estimator based on a specific training sequence design. In Example 5.9, we use properties of 4-QAM 
to develop a blind frequency offset estimator. We compare their performance to highlight differences 
in the proposed approaches in Figure 5.15. We also explain how to jointly estimate the delay and 
channel with each approach. These algorithms illustrate some ways that known information or signal 


structure can be exploited for estimation. 
© coherent approach 
X_ blind approach 
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mean squared error 


10° 10° 
total number of samples 
Figure 5.15 The mean squared error performance of two different frequency offset estimators: the 
training-based approach in Example 5.8 and the blind approach in Example 5.9. Frame 
synchronization 1s assumed to have been performed already for the coherent approach. The channel 
is the same as in Example 5.3 and other examples, and the SNR is 5dB. The true frequency offset is 


¢ = 0.01. The estimators are compared assuming the number of samples (N,, for the coherent case 
and N,,; for the blind case). The error in each Monte Carlo simulation is computed from phase 


(6°) t5 avoid any phase wrapping effects. 


Example 5.8 


In this example, we present a coherent frequency offset estimator that exploits training data. We 
suppose that frame synchronization has been solved. Let us choose as a training sequence ¢[n] = 
exp(2zf,n) forn =0,1,..., N,,— 1 where f, is a fixed frequency. This kind of structure is available 


from the Frequency Correction Burst in GSM, for example [100]. Then for =0,1,...,N,—1, 
y|n] = el2ren jam fin py 4 u|n]. (5.71) 


Correcting the known offset introduced by the training data gives 


e Jenhe "yint| = gltren yp, 4 platen vin}. (5.72) 
The rotation by e 7" does not affect the distribution of the noise. Solving for the unknown frequency 
in (5.72) is a classic problem in signal processing and single-tone parameter estimation, and there are 


model in [330]. Approximate the corrected signal in (5.72) as 


er Fery in] wy |hlel2mentotvln] (5.73) 


< 


where 9 1s the phase of / and v[m] is Gaussian noise. Then, looking at the phase difference between 
two adjacent samples, a linear system can be written from the approximate model as 


phase(e!?"/"y* [nje 127+ Dain 4 1]) = Qre + vin +1] — v{[n]. (5.74) 
Aggregating the observations in (5.79) fromn=1,...,N,.— 1, we can create a linear estimation 
problem 
p=27el+v (5.75) 


where [p],, = phase(e?""y*[nJe PC) yn + 1]), Lis an N,,— 1 x 1 vector, and vis anN,,- 1 x 1 
noise vector. The least squares solution is given by 


ee 5 , 
é=5-(p*p) ‘1*p (5.76) 
27 
Nui 
= — : h phase(e!?*/* y* [n]y|[n + 1), (5.77) 
27 (Mer 1) ; 


n=] 


which is also the maximum likelihood solution if v[7] is assumed to be IID [174]. 


Frame synchronization and channel estimation can be incorporated into this estimator as follows. 
Given that the frequency offset estimator in (5.77) solves a least squares problem, there is a 
corresponding expression for the squared error; see, for example, (3.426). Evaluate this expression 
for many possible delays and choose the delay that has the lowest squared error. Correct for the offset 
and delay, then estimate the channel as in Section 5.1.4. 


Example 5.9 


In this example, we exploit symmetry in the 4-QAM constellation to develop a blind frequency offset 
estimator, which does not require training data. The main observation is that for 4-QAM, the 
normalized constellation symbols are points on the unit circle. In particular for 4-QAM, it turns out 
that s4[n] =—1. Taking the fourth power, 


y*[n] _ _ el2medn p 4 ite y(n), (5.78) 


where v[m] contains noise and products of the signal and noise. The unknown parameter d disappears 
because s*[n — d] =—1, assuming a continuous stream of symbols. The resulting equation has the form 
of a complex sinusoid in noise, with unknown frequency, amplitude, and phase similar to (5.72). Then 
a set of linear equations can be written from 


phase(y*[n + 1]y**[n]) = 82 + D[n] (5.79) 


ina similar fashion to (5.75), and then 


l Ntot —] 


é= 7 >. phase(y*[n + 1]y**{n)). (5.80) 


87r( Ntot =. 


n=1 


Frame synchronization and channel estimation can be incorporated as follows. Use all the data to 
estimate the carrier frequency offset from (5.80). Then correct for the offset and perform frame 
synchronization and channel estimation based on training data, as outlined in Section 5.1.3 and 
Section 5.1.4. 


5.2 Equalization of Frequency-Selective Channels 


In this and subsequent sections, we generalize the development in Section 5.1 to frequency-selective 
fading channels. We focus specifically on equalization, assuming that channel estimation, frame 
synchronization, and frequency offset synchronization have been performed. We solve the channel 
estimation problem in Section 5.3 and the frame and frequency offset synchronization problems in 
Section 5.4. First we develop the discrete-time received signal model, including a frequency- 
selective channel and AWGN. Then we develop three approaches for linear equalization. The first 
approach is based on constructing an FIR filter that approximately inverts the effective channel. The 
second approach is to insert a special prefix in the transmitted signal to permit frequency-domain 
equalization at the receiver, in what is called SC-FDE. The third approach also uses a cyclic prefix 
but precodes the information in the frequency domain, in what is called OFDM modulation. As the 
equalization removes intersymbol interference, standard symbol detection follows the equalization 
operations. 


5.2.1 Discrete-Time Model for Frequency-Selective Fading 

In this section, we develop a received signal model for general frequency-selective channels, 
generalizing the results for a single path in Section 5.1.1. Assuming perfect synchronization, the 
received complex baseband signal after matched filtering but prior to sampling is 


y(t) — Orx(t )* hit * 4/ Fy ; y: s|m|gex(t — mT) + Orx(é) * u(t) (5.81) 


m=— OO 


we 


Ss" s|m| heg(t — mT) + grx(t) * v(t). (5.82) 


m= — 00 


Essentially, y(t) takes the form of a eB pulse- ene modulated signal but where g(t) is 


replaced by Nest (t) = Grx(t) * Gex(t) * VExh(t ) Except in special cases, this new effective pulse 
is no longer a Nyquist pulse shape. 


We now develop a sampled signal model. Let 
hin] = hep (nT’) (5.83) 
denote the sampled effective discrete-time channel. This channel combines the complex baseband 
equivalent model of the propagation channel, the transmit pulse-shaping filter, the receive pulse 


matched transmit filter, and the scaled transmit energy V Ey. 


Sampling (5.82) at the symbol rate and with the effective discrete-time channel gives the received 
signal 


= 5° 9 h[é|s[n — €] + v[n]. (5.84) 
f=—o00 


The main distortion is ISI, since every observation y[] is a linear combination of all the transmitted 
symbols through the convolution integral. 


Example 5.10 
To get some insight into the impact of intersymbol interference, suppose that 


h[n] = VE, 6[n] + V Exh 6[n — 1] Then 
y(n] = V Exs[n] + V Exhis[n — 1] + v[n]. (5.85) 
The n" symbol s[n] is subject to interference from s[n — 1], sent in the previous symbol period. 
Without correcting for this interference, detection performance will be bad. Treating ISI as noise, 
Ex 


SINR = : =, 5.86 
Ex\hy be 7 No ( ) 


For example, for SNR = 10dB = 10 and |4,/? = 1, SINR = 10/(10+1) = 0.91 or —0.4dB. Figure 5.16 
shows the SINR as a function of |, for SNR = 10dB and SNR = 5dB. 
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Figure 5.16 The SINR as a function of |1,|? for the discrete-time channel 
h[n] = VExd[n] + VExhid[n — lin Example 5.10. 


To develop receiver signal processing algorithms, it is reasonable to treat i[n] as causal and FIR. 
The causal assumption is reasonable because the propagation channel cannot predict the future. 
Furthermore, frame synchronization algorithms attempt to align, assuming a causal impulse response. 
The FIR assumption is reasonable because (a) there are no perfectly reflecting environments and (b) 
the signal energy decays as a function of distance between the transmitter and the receiver. 


Essentially, every time the signal reflects, some of the energy passes through the reflector or is 
scattered and thus it loses energy. As the signal propagates, it also loses power as it spreads in the 
environment. Multipaths that are weak will fall below the noise threshold. With the FIR assumption, 


L 
y[n| = >— h[Qs[n — 4 + v[n]. (5.87) 
£=0 


The channel is fully specified by the L+1 coefficients {hlé]} £0, The order of the channel, given by L, 
determines to a large extent the severity of the ISI. Flat fading corresponds to the special case of L = 
0. We develop equalizers specifically for the system model in (5.87), assuming that the channel 
coefficients are perfectly known at the receiver. 


5.2.2 Linear Equalizers in the Time Domain 

In this section, we develop an FIR equalizer to remove (approximately) the effects of ISI. We suppose 
that the channel coefficients are known perfectly; they can be estimated using training data via the 
method described in Section 5.3.1. 

There are many strategies for equalization. One of the best approaches is to apply what is known as 
maximum likelihood sequence detection [348]. This is a generalization of the AWGN detection rule, 
which incorporates the memory in the channel due to L > 0. Unfortunately, this detector 1s complex to 
implement for channels with large Z, though note that it is implemented in practical systems for 
modest values of L. Another approach is decision feedback equalization, where the contributions of 
detected symbols are subtracted, reducing the extent of the ISI [17, 30]. Combinations of these 
approaches are also possible. 


In this section, we develop an FIR linear equalizer that operates on the time-domain signal. The 
pe 
goal of a linear equalizer is to find a filter that removes the effects of the channel. Let {fl tee 0 be an 
FIR equalizer. The length of the equalizer is given by Ly. The equalizer is also associated with an 
equalizer delay ng, which is a design parameter. Generally, allowing ng > 0 improves performance. 
The best equalizers consider several values of ng and choose the best one. 


An ideal equalizer, neglecting noise, would satisfy 
Ly 
S > f[ghin — £) = d[n — ng] (5.88) 
é=0 
forn=0,1,..., L¢+ L. Unfortunately, there are only Z>+ 1 unknown parameters, so (5.88) can be 
satisfied exactly only in trivial cases like flat fading. This is not a surprise, as we know from DSP 
that the inverse of an FIR filter is an IIR filter. Alternatively, we pursue a least squares solution to 
(5.88). 
We write (5.88) as a linear system and then find the least squares solution. The key idea is to write 
a set of linear equations and solve for the filter coefficients that ensure that (5.88) minimizes the 
squared error. First incorporate the channel coefficients into the L>+ L + 1 x L-+ 1 convolution 
matrix: 


He L 5.89 
h{L] hl] oe) 
0 AL] 
h|L) 
Then write the equalizer coefficients in a vector 

f (0) 
f (1 | 
ES (5.90) 

f (Le 


and the desired response as the vector en, with zeros everywhere except for a | in the ny + | position. 


With these definitions, the desired linear system is 


Hf, = Gny: (5.91) 
The least squares solution is 
fisn, = (H*H)'H*e,, (5.92) 
with squared error 
J{na] = e},, (I — H(H*H)~*H")e,,. (5.93) 


The squared error can be minimized further by choosing ng such that J[ng] is minimized. This is 
known as optimizing the equalizer delay. 


Example 5.11 
In this example, we compute the least squares optimal equalizer of length >= 6 for a channel with 


impulse response [0] = 0.5, A[1] =j/2, and h[2] = 0.4 exp(j * 2/5). First we construct the 
convolution matrix 


j/2 05 0 0 O 0 0 
04 j/2 05 0 0 0 0 
0 04 3/2 05 0 O O 
H=/]0 0 04 j/2 05 0 O (5.94) 
0 0 O O04 j/2 05 O 
0 0 O O 04 j/2 0.5 
0 0 0 0 OO 04 j/2 
0 0 0 0 0 0 04 


and use it to compute (5.93) to determine the optimal equalizer length. As illustrated in Figure 
5.17(a), the optimum delay occurs at ng = 5 with J[5] = 0.0266. The optimum equalizer derived from 


(5.92) is 


—0.1051 — j0.1054 
—0.1848 + j0.1665 
0.2100 + j0.3607 

fis.5 = | 0.6065 — j0.2521 | . (5.95) 
—0.2146 — j0.9521 
0.4835 + j0.0926 
—0.1907 + j0.1905 


0.7 


© 

rep) 
=) 
ee) 


S07 
0.5 5 °. 
5s 
és S 0.6 
E 0.4 - 
oO v5) 0.5 
2 = 
oO 
=. 0.3 £04 
w” 
i ©) 
N 03 
0.2 = 
=) 
oO 
® 0.2 
0.1 
0.1 
0 0 
0 2 4 6 8 0 2 4 6 8 
index n 


(a) (b) 
Figure 5.17 The squared error for the equalizer J[n,] in (a) and the equalized channel h[n] * fri” 


in (b), corresponding to the parameters in Example 5.11 
The equalized impulse response is plotted in Figure 5.17(b). 


The matrix H has a special kind of structure. Notice that the diagonals are all constant. This is 
known as a Toeplitz matrix, sometimes called a filtering matrix. It shows up often when writing a 
convolution in matrix form. The structure in Toeplitz matrices also leads to many efficient algorithms 
for solving least squares equations, implementing adaptive solutions, and so on [172, 294]. This 
structure also means that H is full rank as long as at least one coefficient is nonzero. Therefore, the 
inverse in (5.92) exists. 


The equalizer 1s applied to the sampled received signal to produce 
s[n — nq] -> Fra l€ly|n — €. (5.96) 


The delay ng is known and can be corrected by advancing the output by the corresponding number of 
samples. 


An alternative to the least squares equalizer is the LMMSE equalizer. Let us rewrite (5.96) to 
place the equalizer coefficients into a single vector 


s[n — ng] = fy (nl (5.97) 


where y"[n] =[y[n], yin - 1],...,9[n -Ll', 


y(n] = H's{n] + v[n] (5.98) 
with s![n] =[s[n], s[n—1],..., s[n—L]]" and Has in (5.89). We seek the equalizer that minimizes 
the mean squared error 

v 2 cies 
E sin — nq] — fy y[n]| (5.99) 


2 
Assume that s[] is IID with zero mean and unit variance, v[] is IID with variance 7v, and s[n] and 
y[n] are independent. As a result, 


Cyy = E[y[nly*[n]] (5.100) 
= H'H* + 071 (5.101) 
and 
Cy; = E[y|n|s*[n — na] (5.102) 
= H’e,,. (5.103) 


Then we can apply the results from Section 3.5.4 to derive 


fna,MMSE = Cy Cy, (5.104) 


= (H*H + 071)‘ He, (5.105) 


where the conjugate results from the use of conjugate transpose in the formulation of the equalizer in 


(3.459). The LMMSE equalizer gives an inverse that is regularized by the noise power 7 4 which is 
N, if the only impairment is AWGN. This should improve performance at lower values of the SNR 


where equalization creates the most noise enhancement. 


— ( 


2 
The asymptotic equalizer properties are interesting. As 7x  f,, ,MMSE — fisyn,- In the absence 


. 7 . 2 , f oh —? = 3 H*e . 
of noise, the MMSE solution becomes the LS solution. As 7% —? 00, “4/MMSE oy 4 This 
can be seen as a spatially matched filter. 


A block diagram of linear equalization and channel estimation is illustrated in Figure 5.18. Note 
that the optimization over delay and correction for delay are included in the equalization computation, 
though they could be separated in additional blocks. Symbol synchronization is included in the 
diagram, despite the fact that linear equalization can correct for symbol timing errors through 
equalization. The reason is that SNR performance can nonetheless be improved by taking the best 
sample, especially when the pulse shape has excess bandwidth. An alternative is to implement a 
fractionally spaced equalizer, which operates on the signal prior to downsampling [128]. This is 
explored further in the problems at the end of the chapter. 
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Figure 5.18 Receiver with channel estimation and linear equalization 


A final comment on complexity. The length of the equalizer Lyris a design parameter that depends 


on L. The parameter L is the extent of the multipath in the channel and is determined by the bandwidth 
of the signal as well as the maximum delay spread derived from propagation channel measurements. 
The equalizer is an approximate FIR inverse of an FIR filter. As a consequence, performance will 
improve if Lr is large, assuming perfect channel knowledge. The complexity required per symbol, 


however, also grows with L,. Thus there is a trade-off between choosing large Ly to have better 
equalizer performance and smaller Ly to have more efficient receiver implementation. A rule of thumb 
is to take L- to be at least 4Z. 


5.2.3 Linear Equalization in the Frequency Domain with SC-FDE 


Both the direct and the indirect equalizers require a convolution on the received signal to remove the 
effects of the channel. In practice this can be done with a direct implementation using the overlap- 
and-add or overlap-and-save methods for efficiently computing convolutions in the frequency domain. 
An alternative to FIR in the time domain is to perform equalization completely in the frequency 
domain. This has the advantage of allowing an ideal inverse of the channel to be computed. 
Application of frequency-domain equalization, though, requires additional mathematical structure in 
the transmitted waveform as we now explain. 


In this section, we describe a technique known as SC-FDE [102]. At the transmitter, SC-FDE 
divides the symbols into blocks and adds redundancy in the form of a cyclic prefix. The receiver can 
exploit this extra information to permit equalization using the DFT. The result is an equalization 
strategy that is capable of perfectly equalizing the channel, in the absence of noise. SC-FDE is 
supported in IEEE 802.11ad, and a variation is used in the uplink of 3GPP LTE. 


We now explain the motivation for working with the DFT and the key ideas behind the cyclic 
prefix. First, we explain why direct application of the DTFT is not feasible. Consider the received 
signal with intersymbol interference but without noise in (5.87). In the frequency domain 


y (e!27F) = (e!27F) S (e!?7F) ; (5.106) 
The ideal zero-forcing equalizer is 


1 ae 
(5.107) 


f (e?"F) =F h (ei27f) ; 


Unfortunately, it is not possible to directly implement the ideal zero-forcing equalizer in the DITFT 


frequency domain. First of all, the equalizer does not exist at f for which H(e?”) is zero. This can be 
solved by using a pseudo-inverse rather than an inverse equalizer. Several more important problems 
occur as a by-product of the use of the DTFT. It is often not possible to compute the ideal DTFT in 
practice. For example, the whole {s[n]} > S(e?"/) is required, but typically only a few samples of 
s[n] are available. Even when {s[n]! is available, the DTFT may not even exist since the sum may 
not converge. Furthermore, it is not possible to observe over a long interval since h[f] is time 
invariant only over a short window. 

A solution to this problem is to use a specially designed {s[m]} and leverage the principles of the 
discrete Fourier transform. A comprehensive discussion of the DFT and its properties is provided in 
Chapter 3. To review, recall that the DFT is a basis expansion for finite-length signals: 


N-1 

Analysis x |k] = >, x|nje~I kr k=0,1,...,.N—-1 (5.108) 
n=0 
1 N-1 

Synthesis 2[n| = — > x[kli ve" n =0,1,...,N —1. (5.109) 
" k=0 


The DFT can be computed efficiently with the FFT for N as a power of 2 and certain other special 
cases. Practical implementations of the DFT use the FFT. The key properties of the DFT are 
summarized in Table 3.5. 


A distinguishing feature of the DFT is that multiplication in frequency corresponds to circular 
N-1 _InlVN-1 
convolution in time. Consider two sequences {71["]fn=0 and {®2I"|}n=0. Let F ‘y (-) denote the 


1 
DFT operation ona length N signal, and let ~~) denote its inverse. With x,[k] = Fy (x,[n]) and 
x9[k] = Fry [x2[7]], then 


N-1 


a (x1[k] xo[k]) = y;, £1|m]xo[((n — m)) ny]. (5.110) 


m=O 


Unfortunately, linear convolution, not circular convolution, is a good model for the effects of wireless 
propagation per (5.87). 

It is possible to mimic the effects of circular convolution by modifying the transmitted signal with a 
suitably chosen guard interval. The most common choice is what is called a cyclic prefix, illustrated 
in Figure 5.19. To understand the need for a cyclic prefix, consider the circular convolution between 


{s[n]} no {h{4) bio, whi 
a block of symbols of length NV where N > LZ: 1°U'"JSn=0 and the channel é=0, which has been 


zero padded to have length N, that is, A[n] =0 forn © [ZL + 1, N— 1]. The output of the circular 
convolution is 


yln] = >— A[Qs[((n— 2) yn] (5.111) 
é=0 
= Do Alds (((n - 9) (5.112) 
é=0 
n ) ) L , , 
N£\sin —£ ; Jfisintn—ée <n 
_ do eao Pléls[n a + eens Plelsint+n—4 O<n<L (5.113) 
peo hlAls[n — 4 n> D. 
Last L. 
+L,> symbols 
Cyclic 3 
prefix Block of N data symbols ——~————* 


Ne” 


Figure 5.19 The cyclic prefix 
The portion for n > L looks like a linear convolution; the circular wrap-around occurs only for the 
first L samples. 


Now we modify the transmitted sequence to obtain a circular convolution from the linear 
convolution introduced by the channel. Let L, = L be the length of the cyclic prefix. Form the signal 


{w[n| N+L,.-1 


n=0 where the cyclic prefix is 
w[n] =s[fn+N-—L,| n=0,1,...,L.-—1 (5.114) 
and the data is 
w[n] =s[n-—D,| n=L,,£.4+1,...,.Lbe.+N-—1. (5.115) 


After convolution with an L + 1 tap channel, and neglecting noise for the derivation, 
rE 
y(n| = a h[éjwi[n — }. (5.116) 
é=0 


Now we neglect the first L, terms of the convolution, which is called discarding the cyclic prefix, to 
form the new signal: 


~~ 


y{n| _ y|n = ig L] n= 0, 1, cag aN =O (5.117) 


= > hiéjw[n + L, — £. (5.118) 


z— 


To see the effect, it is useful to evaluate ¥|”] for a few values of n: 


L 
910] =) hlquf0+ L. - 4 (5.119) 


=0 
= h[0)w/L,] + h[1}w[L, — 1) +---+ Alé —_ — £ (5.120) 
= ii (O] + A[1]s[N — 1) +---+ -Aldletnv —-L-lj (5.121) 
= Shel ~ t))n] (5.122) 
L 
WL —1 =) Algw[L-1+L.-¢ (5.123) 
=0 
= h[O)w[Z —1+ L,] + Alljw[L -1+ 2, -1] 
+---+hA[Gw[L-14+ 2, -4 (5.124) 
= ~— [L — 1] + A[l|s[Z — 2] +--- + A[L — 1]s[0] + A[As[N — 1] (5.125) 
= Smal L-1-2))y] (5.126) 
f= 3 h[éjw(L + LD, — € (5.127) 
=0 
= h(O]w/L + £,) + Alljw[/L+ 2.-1)+---+h[Qw[/L+ L, - 4 (5.128) 
= h[0)s[¢] + h{1]s[é] + --- + Alé]s[0] (5.129) 
se 
= 5 Al4s(((L- )y] (5.130) 
e=0 
L 
= S°hlds(L - 4. (5.131) 
£=0 


For values of n < L, the cyclic prefix replicates the effects of the linear convolution observed in 
(5.113). For values of N > L, the circular convolution becomes a linear convolution, also as expected 
from (5.113) because L < N. In general, the truncated sequence satisfies 


L 
alr] = S— hlé|s[((n — 2))y]. (5.132) 
£=0 


Therefore, thanks to the cyclic prefix, it is possible to implement frequency-domain equalization 
simply by computing y[A] = Fn (yn ) s[k] = Fy (s[n]), and then 


s[n] = Fy’ Ga (5.133) 
_ gif Fn(yln)) |, 
= Fn (ee. (5.134) 


This 1s the key idea behind the SC-FDE equalizer. 
The cyclic prefix also acts as a guard interval, separating the contributions of different blocks of 
ze N-1 N-1 
data. To see this, note that {9"| }n—o depends only on symbols {s[n]}n—0 and not symbols sent in 
previous blocks like s[”] for n <0 or n > N. Zero padding can alternatively be used as a guard 
interval, as explored in Example 5.12. 


Example 5.12 
Zero padding is an alternative to the cyclic prefix [236]. With zero padding, L, zero values replace 
the cyclic prefix where 


vin|—O8 A—=OD Isc, [L, —1. (5.135) 


The data is encoded as in (5.115) and L < L,. We neglect noise for this problem. 
* Show how zero padding enables successive decoding of s[n] from y[n]. Hint: Start with n = 0 
and show that s[0] can be derived from y[L,]. Let 3|0| denote the detected symbol. Then show 


how, by subtracting off 8 [0] you can detect 5|1] Then assume it is true for a given n and show 
that it works for n+ 1. 


Answer: To decode s[0], we look at the expression of y[L,]. After expanding and simplifying, 
because of the zeros, we obtain y[L,] = A[O]w[L,] = A[0]s[0]. Because A[0] is known, we can 
decode s[0] as 


(5.136) 


To decode s[1], we use y[L, + 1] and the previously detected value of 3|0| Because 
y[Le +1] = hfOjw[L.+1) +A[l}w[L-] = h[O]s[1]+h{1]s[0) ana assuming the detected 
symbol is correct, 


Le + 1] — h{1]3 (0) 


oe —— 
s[1] = nj0) (5.137) 


~ P| 
Finally, assume that {s[n]} n=0 has been decoded; then 


L 
y[L. +k] = > hléjw[L. +k — | (5.138) 


é=0 
L 
= h[0|s[k] + 5 hléjs[k — ¢]. (5.139) 
f=] 
We thus can decode s[k] as follows: 
yiLe +k k-—é . 
ylLe +k] — Dey hidistk — — 


ale = h{0) 


The main drawback of this approach is that it suffers from error propagation: a symbol error in 
5|k'] affects the detection of all symbols after k. 
* Consider the following alternative to discarding the cyclic prefix: 


yln] =y[n+L,]|+yln+N+4+0,| for n=0,1,...,2-1 (5.141) 
y[n] = y[n+L,] for n=L,£L+4+1,...,N—-1. (5.142) 


Show how this structure also permits frequency domain equalization (neglect noise for the 
derivation). 


Answer: Forn=0,1,...,L-—1: 


y[n] =y[n+ L.] +y[n+N+L, J (5.143) 
=> ald [Elwin + L. — €| + < h\ ela 
£=0 f=n+1 
, L 
+ 3 h[é)w[n+W+ L, — é] + 5. h[éjw[n + N +L, — € (5.144) 
es f=n+1 
L 
= ay h[éjs[n — é) + + >> hléjs[n + N — @ (5.145) 
£=0 é=n+1 
= - 5" Alas (n — £))n] (5.146) 


£=0 


where the cancellation is due to the cyclic prefix. Forn=L, L—-1,...,N-1: 


b 
a = », h[é]w[n + L, — € (5.147) 


&£=0 

= s hlé|s[n — €] (5.148) 
&£=0 

= SP hile) (n — £))n]. (5.149) 


£=0 


Therefore, !/ "| is the same as (5.132), and the same equalization as SC-FDE can be applied, 
with a little performance penalty due to the double addition of noise. Zero padding has been 
used in UWB (ultra-wideband) [197] to make multiband OFDM easier to implement, and in 
millimeter-wave SC-FDE systems [82], where it allows for reconfiguration of the RF 
parameters without signal distortion. 


Noise has an adverse effect on the performance of the SC-FDE equalizer. Now we examine what 
happens to the effective noise variance after equalization. In the presence of noise, and with perfect 
channel state information, 


s[n] = s[n] + Fx" (v[k]/h[k]). (5.150) 


The second term is augmented during the equalization process in what is called noise enhancement. 


Let “."| be the enhanced noise component. It remains zero mean. To compute its variance, we expand 
the enhanced noise as 


N-I1 N-1 — 425 bry 
~ ! jp Ree: u[mle ~— j2tkn a4 
vin| = — ) = JN 5.151 
v|n] N 2. hie € (5.151) 


and then compute 


N-1 ~WN-1 he 
ae ty ] mje seim 
E[v[n|v*[n]] = E 7 >. dim =0 VM] el ekin 


sv ky =0 h[ky| 


N-1 N-1 j2tkeme 
LS Lmao 1 tral a8 ham (5.152) 
N h*[ko] 
ko=0 
Pe N-1 N-1 N-1 N-1 5[my ma] 
= v = ~ ese kimy j= kome 27 hynny —j=tkong 5153 
N? Do. iki lF=Thal| ern a 
k,=0 ke =0 m,=0 m2=0 
o2 N-1 N-1N-1 1 . 
= v ede (ki —k2)m 55 a y Kins ¢ »— JF ke ne 5.154 
N? hiki]h* [ho] aie 
k,=0 ko=0 m=0 
a2 N-1 1 
Jan ji An i xg kn eRe 
N ih] |2- : (5.155) 
k=0 
> N-1 
a 1 B. aibe 
= .156 
N & |h{k]|? (5.156) 
k=0 


where the results follow from the IID property of v[”] and repeated applications of the orthogonality 
of discrete-time complex exponentials. The main conclusion is that we can model (5.150) as 


s[n] = s[n] + v[n] (5.157) 


where v[7] is AWGN with variance given in (5.156), which is the geometric mean of the inverse of 
the channel in the frequency domain. We contrast this result with that obtained using OFDM in the next 
section. 

An implementation of a QAM system with frequency-domain equalization is illustrated in Figure 
5.20. The operation of collecting the input bits into groups of N symbols is given by the serial-to- 
parallel converter. A cyclic prefix takes the NV input symbols, copies the last L, symbols to the 
beginning, and outputs VN + L, symbols. The resulting symbols are then converted from parallel to 
serial, followed by upsampling and the usual matched filtering. The serial-to-parallel and parallel-to- 
serial blocks can be implemented from a DSP perspective using simple filter banks. 


|! Parallel 


oy) serial 
+Z..): 


er 
M, 
Figure 5.20 QAM transmitter for a single-carrier frequency-domain equalizer, with CP denoting 
cyclic prefix 


Compared with linear equalization, SC-FDE has several advantages. The channel inverse can be 
done perfectly since the inverse is exact in the DFT domain (assuming, of course, that none of the 


h[k] are zero), and the time-domain equalizer is an approximate inverse. SC-FDE also works 
regardless of the value of L as long as L, = L. The equalizer complexity is fixed and is determined by 


the complexity of the FFT operation, proportional to V log, NV. The complexity of the time-domain 
equalizer is a function of K and generally grows with L (assuming that K grows with L), unless it is 
itself implemented in the frequency domain. As a general rule of thumb it becomes more efficient to 
equalize in the frequency domain for LZ around 5. 

The main parameters to select in SC-FDE are N and L,. To minimize complexity it makes sense to 
take N to be small. The amount of overhead, though, is L,/(N + L,). Consequently, taking N to be large 
reduces the system overhead incurred by redundancy in the cyclic prefix. Too large an N, however, 
may mean that the channel varies over the N symbols, violating the LTT assumption. In general, L, is 
selected to be large enough that L < L, for most channel realizations, throughout the use cases of the 
wireless system. For example, for a personal area network application, indoor channel measurements 
may be used to establish the power delay profile and other channel statistics from which a maximum 
value of L, can be derived. 


5.2.4 Linear Equalization in the Frequency Domain with OFDM 

The SC-FDE receiver illustrated in Figure 5.21 performs a DFT on a portion of the received signal, 
equalizes with the DFT of the channel, and takes the IDFT (inverse discrete Fourier transform) to 
form the equalized sequence 8|N| This offloads the primary equalization operations to the receiver. In 
some cases, however, it is of interest to have a more balanced load between transmitter and receiver. 


A solution is to shift the IDFT to the transmitter. This results in a framework known as multicarrier 
modulation or OFDM modulation [63, 72]. 
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Figure 5.21 Receiver with cyclic prefix removal and a frequency-domain equalizer 


Several wireless standards have adopted OFDM modulation, including wireless LAN standards 
like IEEE 802.11a/b/n/ac/ad [279, 290], fourth-generation cellular systems like 3GPP LTE [299, 16, 
93, 253], digital audio broadcasting (DAB) [333], and digital video broadcasting (DVB) [275, 104]. 


In this section, we describe the key operations of OFDM at the transmitter as illustrated in Figure 


5.22 and the receiver as illustrated in Figure 5.23. We present OFDM from the perspective of having 
already derived SC-FDE, though historically OFDM was developed several decades prior to SC- 
FDE. We conclude with a discussion of OFDM versus SC-FDE versus linear equalization techniques. 
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Figure 5.22 Block diagram of an OFDM transmitter. Often the upsampling and digital pulse 
shaping are omitted when rectangular pulse shapes are used. The inverse DFT is implemented 
using an inverse fast Fourier transform (IFFT). 
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Figure 5.23 Block diagram of an OFDM receiver. Normally the matched filtering and symbol 
synchronization functions are omitted. 


The key idea of OFDM, from the perspective of SC-FDE, is to insert the IDFT after the first serial- 


N-1 
to-parallel converter in Figure 5.22. Given {s[n]}n=0 anda cyclic prefix of length L,, the transmitter 
produces the sequence 


wn] — WN >; s{m]e!?" me mid n= Q), tees N + i. a 1, (5.158) 


which is passed to the transmit pulse-shaping filter. The signal w[7] satisfies w[n] = w[n + N] for n= 
0,1,...,£,—1; therefore, it has a cyclic prefix. The samples {w[Z,], w[Z, + 1],...,w[N+L, - 


fly V—1 
1]} correspond to the IDFT of {|| }n=0 . Unlike the SC-FDE case, the transmitted symbols can be 
considered to originate in the frequency domain. We do not use the frequency-domain notation for s[n] 
for consistency with the signal model. 


We present the transmitter structure in Figure 5.22 to make the connection with SC-FDE clear. In 
OFDM, though, it is common to use rectangular pulse shaping where 


it 1 ] 
Jtx(t) = Arp rect (x) ; (5.159) 


This function is not bandlimited, so it cannot strictly speaking be implemented using the upsampling 
followed by digital pulse shaping as shown in Figure 5.22. Instead, it 1s common to simply use the 
“stair-step” response of the digital-to-analog converter to perform the pulse shaping. This results in a 
signal at the output of the DAC that takes the form 


N/2-1 
es m({t—L-T') ; . 
w(t) = — pa s[m]e!?* ~~ TN ~ for te (0, (N+ L,)T}. (5.160) 


m=—N/2 


This interpretation shows how symbol s[m] rides on continuous-time carrier exp(j21tm/NT) at 
baseband with a carrier of frequency 1/N7, whichis also called the subcarrier bandwidth. This is one 
reason that OFDM is also called multicarrier modulation. We write the summation as shown in 
(5.160) to make it clear that frequencies around... , N—3,N-—2,N- 1,0, 1,2,... are low 
frequencies whereas those around N/2 are high frequencies. Subcarrier n = 0 is known as the DC 
subcarrier, which is often assigned a zero symbol to avoid DC offset issues. The subcarriers near N/2 
are often assigned zero symbols to facilitate spectral shaping. 


Now we show how OFDM works. Consider the received signal as in (5.116). Discard the first L, 
samples to form 


y[n] =y[n + D.] n=0,1,...,N—1. (5.161) 


Inserting (5.158) for w[n] and interchanging summations gives 


L 
ile] = Yo hldw[n + Le (5.162) 
c=) 
1 L N-I] —s 
~ N hd DL slime (5.163) 
&£=0 m=0 
1 L N-1 
= Dh DL aime?" Fe“ Px 5.164) 
; £=0 m=0 
1 N-1 L 
=a DD nlc" ® s[mjel?n (5.165) 
“" m=0 &=0 
h[{m] 


= Fx" (h[m]s[m)). (5.166) 


Therefore, taking the DFT gives 
y(n] = h[n]}s[n| (5.167) 


forn=0,1,...,N—1, and equalization simply involves multiplying by h![n]. Low SNR 
performance could be improved by multiplying by the LMMSE equalizer | h[n]|° + a5) *h*[n] 
instead of by h ![n]. In terms of time-domain quantities, 
¥ Fn (yin}) ee; 
s|n| = ————-.. (5.168) 
tn] n(h|n]) 
The effective channel experienced by s[”] is h[n], which is a flat-fading channel. OFDM effectively 
converts a problem of equalizing a frequency-selective channel into that of equalizing a set of parallel 
flat-fading channels. Equalization thus simplifies a great deal versus time-domain linear equalization. 


The terminology in OFDM systems is slightly different from that in single-carrier systems. Usually 
in OFDM, the collection of samples including the cyclic prefix {w|n] } nat is called an OFDM 
symbol. The constituent symbols {s(n} aot are called subsymbols. The OFDM symbol period is (NV 
+ L,)T, and Tis called the sample period. The guard interval, or cyclic prefix duration, is L,7. The 
subcarrier spacing is 1/(NT) and is the spacing between adjacent subcarriers as measured ona 
spectrum analyzer. The passband bandwidth is 1/7, assuming the use of a sinc pulse-shaping filter 
(which is not common; a rectangular pulse shape is used along with zeroing certain subcarriers). 

There are many trade-offs associated with selecting different parameters. Making N large while L, 
is fixed reduces the fraction of overhead N/(N +L,) due to the cyclic prefix. A larger N, though, means 
a longer block length and shorter subcarrier spacing, increasing the impact of time variation in the 
channel, Doppler, and residual carrier frequency offset. Complexity also increases with larger values 
as the complexity of processing per subcarrier grows with log, N. 


Example 5.13 

Consider an OFDM system where the OFDM symbol period is 3.2, the cyclic prefix has length L, 
= 64, and the number of subcarriers is N = 256. Find the sample period, the passband bandwidth 
(assuming that a sinc pulse-shaping filter is used), the subcarrier spacing, and the guard interval. 


Answer: The sample period T satisfies the relation T (256 + 64) = 3.2us, so the sample period is 7 
= 10ns. Then, the passband bandwidth is 1/7 = 100MHz. Also, the subcarrier spacing is 1/(NT) = 
390.625kHz. Finally, the guard interval is L,.T = 640ns. 


Noise impacts OFDM differently than SC-FDE. Now we examine what happens to the effective 
noise variance after equalization. In the presence of noise, and with perfect channel state information, 


s[n] = s[n] + “a (5.169) 


where v[n] = Fy (v[n]). Because linear combinations of Gaussian random variables are Gaussian, 


y ) 
and the DFT is an orthogonal transformation, v[n] remains Gaussian with zero mean and variance 7 v. 


Therefore, v[n]/h[n] is AWGN with zero mean and variance 7 »/lh(n]|?. Untike SC-FDE, the noise 
enhancement varies with different subcarriers. When h[7] is small for a particular value of n (close 
to a null in the spectrum), substantial noise enhancement is created. With SC-FDE there is also noise 
enhancement, but each detected symbol sees the same effective noise variance as in (5.156). With 
coding and interleaving, the error rate differences between SC-FDE and OFDM are marginal, unless 
other impairments like low-resolution DACs and ADCs or nonlinearities are included in the 


OFDM is in general more sensitive to RF impairments compared with SC-FDE and standard 
complex pulse-amplitude modulated signals. It is sensitive to nonlinearities because the ratio between 
the peak of the OFDM signal and its average value (called the peak-to-average power ratio) is higher 
in an OFDM system compared with a standard complex pulse-amplitude modulated signal. The 
reason is that the IDFT operation at the transmitter makes the signal more likely to have all peaks. 
Signal processing techniques can be used to mitigate some of the differences [166]. OFDM signals 
are also more sensitive to phase noise [264], gain and phase imbalance [254], and carrier frequency 
offset [75]. 


The OFDM waveform offers additional degrees of flexibility not found in SC-FDE. For example, 
the information rate can be adjusted to current channel conditions based on the frequency selectivity 
of the channel by changing the modulation and coding on different subcarriers. Spectral shaping is 
possible by zeroing certain symbols, as already described. Different users can even be allocated to 
subcarriers or groups of subcarriers in what is called orthogonal frequency-division multiple access 
(OFDMA). Many systems like IEEE 802.11a/g/n/ac use OFDM exclusively for transmission and 
reception. 3GPP LTE Advanced uses OFDM on the downlink and a variation of SC-FDE on the 
uplink where power backoff is more critical. IEEE 802.1lad supports SC-FDE as a mandatory mode 
and OFDM ina higher-rate optional mode. Despite their differences, both OFDM and SC-FDE 
maintain an important advantage of time-domain linear equalization: the equalizer complexity does 
not scale with L, as long as the cyclic prefix is long enough. Going forward, OFDM and SC-FDE are 
likely to continue to see wide commercial deployment. 


5.3 Estimating Frequency-Selective Channels 


When developing algorithms for equalization, we assumed that the channel state information ¢/2¢ he =0 
was known perfectly at the receiver. This 1s commonly known as genie-aided channel state 
information and is useful for developing analysis of systems. In practice, the receiver needs to 
estimate the channel coefficients as engineering an all-knowing genie has proved impossible. In this 
section, we describe one method for estimating the channel in the time domain, and another for 
estimating the channel in the frequency domain. We also describe an approach for direct channel 
equalization in the time domain, where the coefficients of the equalizer are estimated from the training 
data instead of first estimating the channel and then computing the inverse. All the proposed methods 
make use of least squares. 


5.3.1 Least Squares Channel Estimation in the Time Domain 

In this section, we formulate the channel estimation problem in the time domain, making use of a 
known training sequence. The idea is to write a linear system of equations where the channel 
convolves only the known training data. From this set of equations, the least squares solution follows 
directly. 


Ner—1 
Suppose as in the frame synchronization case that {t!n]}n“o isa known training sequence and 
that s[n] = ¢[n] form =0,1,...,N,,— 1. Consider the received signal in (5.87). We have to write 
y[n] only in terms of t[n]. The first few samples depend on symbols sent prior to the training data. For 
example, 


y[0] = ¥ hléjs[n — €] + v[n] (5.170) 
= h{0]t/0] + A[1]s[—1] + h[2]s[—2] +... + A[L]s[/—L] +v[n). (5.171) 
cm? 


includes unknown symbols 


Since prior symbols are unknown (they could belong to a previous packet or a message sent to 
another user, or they could even be zero, for example), they should not be included in the formulation. 
Taking n © [L, N,,— 1], though, gives 
I i 
yn] = ¥— h[Gt[n — 4] + fn, (5.172) 
(=0 

which is only a function of the unknown training data. We use these samples to form our channel 
estimator. 

Collecting all the known samples together, 


y|L] 
y[L + 1] 
vlNe = L| 
i) bi] = +(0) h{0] v(L) 
| |) t(1] h{1] v[L +1] | 
= . .|* (5.173) 
Nee —1) #[M—2) --- t{Me—-1—-Z)| |atz]| lot 1) 


which is simply written in matrix form as 
y = Th-+ Vv. (5.174) 


As a result, this problem takes the form of a linear parameter estimation problem, as reviewed in 
Section 3.5. 


The least squares channel estimate, which is also the maximum likelihood estimate since the noise 
is AWGN, is given by 


his = (T*T)~'T*y, (5.175) 


assuming that the Toeplitz training matrix T is invertible. Notice that the product (T*T) 'T can be 
computed offline ahead of time, so the actual complexity is simply a matrix multiplication. 


To be invertible, the training matrix must be square or tall, which requires that 


Nip BAA PO, OE SDE. (5.176) 


Generally, choosing N,, much larger than L + 1 (the length of the channel) gives better performance. 
The full-rank condition can be guaranteed by ensuring that the training sequence is persistently 
exciting. Basically this means that it looks random enough. A typical design objective is to find a 
sequence such that T*T is (nearly) a scaled identity matrix I. This ensures that the noise remains 
nearly IID among other properties. Random training sequences with sufficient length usually perform 
well whereas ¢[n] = 1 will fail. Training sequences with good correlation properties generally satisfy 
this requirement, because the entries of T*T are different (partial) correlations of ¢[7]. 


Special training sequences can be selected from those known to have good correlation properties. 


Np-1 
One such design uses a cyclically prefixed training sequence. Let {P|n|}n20 bea sequence with 
good periodic correlation properties. This means that the periodic correlation 


Ny-1 [. sey fa i _ 2, 
R,|k] - pam p\n|p*{((n + k))N,| satisfies the property that RDLA]| is small or zero for k > 0. 


7. en Wises  ) Np-—1 
Construct the training sequence {t[n]},“o by prefixing {PIN|Fn20 with Pin}, =Np~L~1, which 
gives an NV, = N, + L training sequence. With this construction 


p\0] pPINp-—1] --- p[Np — L] 
p\1| p|0] --» pIN, -L+4+1] 
T = . . _ (5.177) 
pINno—1]| piINp>—2] --- pIN,-L-1 


Then [T*T], ¢ =, [A — €] contains lags of the periodic correlation function. Therefore, sequences 
with good periodic autocorrelation properties should work well when cyclically prefixed. 


In the following examples, we present designs of T based on sequences with perfect periodic 
autocorrelation. This results in T*T = N,I and a simplified least squares estimate of 


hre = T*v(N..)7! 

his = T*y(Np)~”, We consider the Zadoff-Chu sequences [70, 117] in Example 5.14 and Frank 
sequences [118] in Example 5.15. These designs can be further generalized to families of sequences 
with good cross-correlations as well; see, for example, the Popovic sequences in [267]. 


Example 5.14 
Zadoff-Chu sequences are length N, sequences with perfect periodic autocorrelation [70, 117], which 
means that R,[k] = 0 for k # 0. They have the form 


. M xn? = ; 
pin} =e’ %» for Np even (5.178) 
» Ma(n)(n+1) 


p[inj=e for N, odd (5.179) 
where & is an integer that is relatively coprime with N,, which could be as small as 1. Chu 
sequences are drawn from the V,,-PSK constellation and have length N,. 

Find a Chu sequence of length N,, = 16. 


Answer: We need to find an / that is coprime with Ns Since 16 has divisors of 2, 4, and 8, we 
can select any other number. For example, choosing M = 3 gives 


pin] = et (5.180) 


which gives the sequence 


c :3r >on - So iis - 3 3x 
{p[n]}2_) = {Le! ge 2 pore BeBe ae AIO, Pecan 


Sor 


_3e ;3n ;lin _jSe «30 ;3n 
e 16 e) 4 e! 16> 54] aye 9 16  e! 4 .e 16 \ : (5.181) 


With this choice of training, it follows that T*T = NL 


Example 5.15 
The Frank sequence [118] gives another construction of sequences with perfect periodic correlation, 
which uses a smaller alphabet compared to the Zadoff-Chu sequences. For n = mg + k, 


9,- rk 


p{mq + k] = rr O<km<q (5.182) 


where r must be coprime with g, g is any positive integer, and n © [0, g? — 1]. The length of the 
Frank sequence is then N, = g° and comes froma g-PSK alphabet. 

Find a length 16 Frank sequence. 

Answer: Since the length of the Frank sequence is N,, = g°, we must take g = 4. The only viable 
choice of 7 is 3. With this choice 


p(m4 + k] = e’* hr (5.183) 
for m © [0, 3] and k © [0, 3]. This gives the sequence 
{p[n] 32-9 = {1,1,1,1,1,-j, -1,j,1, -1,1, -1, 1,j, -1, -j}- (5.184) 


The Frank sequence also satisfies T but does so with a smaller constellation size, in this case 4-PSK. 
Note that a 4-QAM signal can be obtained by rotating each entry by exp(jz/4). 


A variation of the cyclic prefixed training sequence design uses both a prefix and a postfix. This 
approach was adopted in the GSM cellular standard. In this case, a ome with good periodic 
correlation properties {p[n]} AL is prefixed by L samples pin] ae N, ~L~1 and postfixed by the 
samples {p|n} “= This results in a sequence with length N,, + 2L. The motivation for this design is 
that 

Np-1 
Ryt|m) =S p[njt*[n +m] = R,[m] for me [-L4+1,L-1]). (5.185) 
n=0 


With perfect periodic correlation, the cross-correlation between p[n] and ¢[7] then has the form of 


PE caren th Di conan AD ceceroy  Mhvasau, ut, (5.186) 


which looks like a discrete-time delta function plus some additional cross-correlation terms 
(represented by u; these are the values of R,,[k] for k > L). As a result, correlation with p[7], 


equivalently convolving with p*[(V, — 1 + 1)], directly produces an estimate of the channel for 
certain correlation lags. For example, assuming that s[n] = ¢[n] form =0,1,...,M,— 1, 


y[n] * p*|(Np — n+ 1)] = Nph[n] + p*[Np — n+ 1] * v[n] (5.187) 


forn=N,,N,+1,...,N, +L. In this way, itis possible to produce an estimate of the channel just by 


correlating with the sequence {p[n]} that 1s used to compose the training sequence {ft[m]}. This has 
the additional advantage of also helping with frame synchronization, especially when the prefix size 
is chosen to be larger than Z, and there are some extra zero values output from the correlation. 


Another training design is ase on sora oe ny sequences [129]. This approach is used 


in IEEE 802.ad [162]. Let {a[n| Ico 0 and 1[n] hee =0 be length N, Golay complementary 
sequences. Such sequences satisfy a special periodic correlation property 
N,z-1 


> a|n|a* [((n +k))n,| + b[n]b* [((n + k))n,] = 2N,6[((k)) ny, ]- (5.188) 


n=0 


Furthermore, they also satisfy the aperiodic correlation property 
Ng—k-1 
S> alnja* [n + k] + b[n]b* [n + k] = 2N,6[k]. (5.189) 


n=) 


It turns out that the family of Golay complementary sequences is much more flexible than sequences 
with perfect periodic correlation like the Frank and Zadoff-Chu sequences. In particular, it is possible 
to construct such pairs of sequences from BPSK constellations without going to higher-order 
constellations. Furthermore, such sequence pairs exist for many choices of lengths, including for any 
Nz as a power of 2. There are generalizations to higher-order constellations. Overall, the Golay 


complementary approach allows a flexible sequence design with some additional complexity 
advantages. 


Nz-1 
Now construct a training sequence by taking {a[n]};,40 , with a cyclic prefix and postfix each of 
=] 
length L, and naturally N, > L. Then append another sequence constructed from {b[n| yan 0 witha 


cyclic prefix and postfix: as well. This gives a length V,, = 2, + 4Z training sequence. Assuming that 
Sim] =i[n] form = 0, 1,..+,N,— 1, 


Nz—-1 Ne 

© ylk + La* [((k+n))n,] + D5 ylk +3L + Nglb* [((k +n)) ny] 

k=0 k=0 

= 2N,h|n] + noise (5.190) 
forn=0,1,..., No ~ 1. As a result, channel estimation can be performed directly by correlating the 


received data with the complementary Golay pair and adding the results. In IEEE 802.11ad, the Golay 
complementary sequences are repeated several times, to allow averaging over multiple periods, 
while also facilitating frame synchronization. They can also be used for rapid packet identification; 
for example, SC-FDE and OFDM packets use different sign patterns on the complementary Golay 
sequences, as discussed in [268]. 

An example construction of binary Golay complementary sequences is provided in Example 5.16. 
We provide an algorithm for constructing sequences and then use it to construct an example pair of 
sequences and a corresponding training sequence. 


Example 5.16 
There are several constructions of Golay complementary sequences [129]. Let a,,[n] and b,,[n] 


denote a Golay complementary pair of length 2”. Then binary Golay sequences can be constructed 
recursively using the following algorithm: 


ag|n] = dn] (5.191) 
bo[n] = d{n] (5.192) 
Qm+i(n| = a(n] n € [0,2™ — 1] (5.193) 
Am+i|[n] = bm[n — 2] ne en 2s ill] (5.194) 
bm+1[n] = am[n] n€ [0,2™ —1] (5.195) 
bm+i[n] = —bm[n — 2™] ne [2", 24! — 1] (5.196) 


Find a length V, = 8 Golay complementary sequence, and use it to construct a training sequence 
assuming L = 2. 
Answer: Applying the recursive algorithm gives 
{a1 [rn] }n-0 = {1,1} ( 
{61 [n]}n=o = {1,-1} ( 
{a2[n]}n=o = {1,1,1,-1} ( 
{bo [n]} =o = {1,1,-1,1} (5. 
( 
( 


— 


~ 


fag|[n]}¢ 4 = {1, 1, 1;=1,1,1;—1,13 
{o3[n]}0-9 = {1,1,1,-1,—-1,-1, 1, -1}. 


The resulting training sequence, composed of Golay complementary sequences with length L = 2 
cyclic prefix and postfix, 1s 


iat a8 Ta a a a a a ae 
(5.203) 


Example 5.17 


IEEE 802.15.3c 60GHz is a standard for high-data-rate WPAN. In this example, we review the 
preamble for what is called the high-rate mode of the SC-PHY. Each preamble has a slightly different 
structure to make them easy to distinguish. The frame structure including the preamble is shown in 
Figure 5.24. Let aj7g and bj denote vectors that contain length 128 complementary Golay sequences 


(in terms of BPSK symbols), and aj<¢ and by5, vectors that contain length 256 complementary Golay 
sequences. From the construction in Example 5.16, aj5¢ = [)79; Dy2g] and by56 = [a)}29; —b,22] where 


we use ; to denote vertical stacking as in MATLAB. The specific Golay sequences used in IEEE 
802.15.3c are found in [159]. The preamble consists of three components: 


1. The frame detection (SYNC) field, which contains 14 repetitions of a, and is used for frame 
detection 

2. The start frame delimiter (SFD), which contains [a1 93 @1293 —@)2g3 @j2g] and is used for frame 
and symbol synchronization 

3. The channel estimation sequence (CES), which contains [b1 93 B2563 82563 Bos63 4256] and is used 
for channel estimation 

¢ Why does the CES start with bj? 
Answer: Since 456 = [81293 Dj28], Dy2g acts as a cyclic prefix. 

¢ What is the maximum channel order supported? 
Answer: Given that bj. is a cyclic prefix, then L, = 128 is the maximum channel order. 

¢ Give a simple channel estimator based on the CES (assuming that frame synchronization and 
frequency offset correction have already been performed). 
Answer: Based on (5.190), 


255 


Rid = S~ yin + 128]aSsq[((€ + n))a56] + yln + 384]bS5[((€ + n)) 256 


n=0 


+ y|n + 640}a556[((¢ + n))o56 


+ y[n + 896]b5<6[((4 + m))256]- (5.204) 
tof t=O) Ly ow ey dg: 


1760Mchips/s, ~1.96ys 


Frame detection Start frame Channel 
(SYNC) delimiter (SFD) estimation (CES) 


Figure 5.24 Frame structure defined in the IEEE 802.15.3c standard 


In this section, we presented least squares channel estimation based on a training sequence. We 


also presented several training sequence designs and showed how they can be used to simplify the 
estimation. This estimator was constructed assuming that the training data contains symbols. This 
approach is valid for standard pulse-amplitude modulated systems and SC-FDE systems. Next, we 
present several approaches for channel estimation that are specifically designed for OFDM 
modulation. 


5.3.2 Least Squares Channel Estimation in the Frequency Domain 


Channel estimation is required for frequency-domain equalization in OFDM systems. In this section, 
we describe several approaches for channel estimation in OFDM, based on either the time-domain 
samples or the frequency-domain symbols. 


First, we describe the time-domain approach. We suppose that N,, = N, so the training occupies 


exactly one complete OFDM symbol. It is straightforward to generalize to multiple OFDM symbols. 
Ni rr” l 
Suppose that the training {tin no is input to the IDFT, and a length L, cyclic prefix is appended, 


{wln]}d "nthe time-doma {win pea 
to create 1!"Jfn=0 —. In the time-domain approach, the IDFT output samples !'!!") fn=0 are 


used to estimate the time-domain channel coefficients, by building T in (5.173) from w[n] and then 
estimating the channel from (5.175). The frequency-domain channel coefficients are then obtained 

yk) a BIA wen Oaekhin — ' 
from H[k] = dip2o Alé] exp(—j27kl/N) The main disadvantage of this approach is that the nice 
properties of the training sequence are typically lost in the IDFT transformation. 


Now we describe an approach where training is interleaved with unknown data in what are called 
pilots. Essentially the presence of pilots means that a subset of the symbols on a given OFDM symbol 
are known. With enough pilots, we show that it is possible to solve a least squares channel estimation 
problem using only the data that appears after the IDFT operation [338]. 


Suppose that training data is sent ona subset of all the carriers in an OFDM symbol; using all the 
symbols is a special case. Let P= {p), Po, ..., pp } denote the indices of the subcarriers that contain 


known training data. The approach we take is to write the observations as a function of the unknown 
channel coefficients. First observe that by writing the frequency-domain channel in terms of the time- 
domain coefficients, 


y[pi] = h[pijt|ps| + v[pi] (5.205) 
h|0| 
_— yet) | Pll | 
= E e nN... ew Be KE] _ | tipi] + vipa). (5.206) 
h|L) 


Collecting the data from the different pilots together: 


y|p2| 0 tp] 0 
y|pp] tIpP| 
‘seanmn soumee! =e’ 
: P 
1 pian eth) Trio] v[p1) 
1 ee eee T Lat] | vipol 
a | | | : 1. (5.207) 
1 e— jan >t eee eo i2n PR h{L} vipM 
SS 
rE h 


This can now be written compactly as 


y = PEh+ v. (5.208) 


The matrix P has the training pilots on its diagonal. It is invertible for any choice of nonzero pilot 
symbols. The matrix E is constructed from samples of DFT vectors. It is tall and full rank as long as 
the number of pilots P > Z + 1. Therefore, with enough pilots, we can compute the least squares 
solution as 


his = (E*P*PE)~!E*P’y. (5.209) 


This approach allows least squares channel estimation to be applied based only on training at known 
pilot locations. The choice of pilot sequences with pilot estimation is not as important as in the time- 
domain approach; the selection of pilot locations is more important as this influences the rank 
properties of E. The approach can be extended to multiple OFDM symbols, possibly at different 
locations for improved performance. 


An alternative approach for channel estimation in the frequency domain is to interpolate the 
frequency-domain channel estimates instead of solving a least squares problem [154]. This approach 
makes sense when training is sent on the same subcarrier across OFDM symbols, to allow some 
additional averaging over time. 

Consider a comb-type pilot arrangement where P pilots are evenly spaced and N, = N/P. This 
uniform spacing is optimum under certain assumptions for LMMSE channel estimation in OFDM 


~ 


systems [240] and for least squares estimators if P*P is a scaled identity matrix. Let hp [cNe] be the 
channel estimated at the pilot locations for c =0,1,..., P—1 with hy [0] a hp|N] and 


hp|—Ne| = hp|N — Ne] to account for wrapping effects. This estimate is obtained by averaging over 
multiple observations of y[p;] 1n multiple OFDM symbols. Using second-order interpolation, the 
missing channel coefficients are estimated as [77] 


(<i on, 
= on Pelle - 1)N] 


f t BE gen ESD ie, - 
= (x ~ 1) (+ + 1) hp[eNe] + aN hp[(e + 1) Ne]. (5.210) 


In this way, the frequency-domain channel coefficients may be estimated directly from the pilots 
without solving a higher-dimensional least squares problem. With a similar total number of pilots, the 
performance of the time-domain and interpolation algorithms can be similar [154]. 


There are other approaches for channel estimation that may further improve performance. For 
example, if some assumptions are made about the second-order statistics of the channel, and these 
statistics are known at the receiver, then LMMSE methods may be used. Several decibels of 
performance improvement, in terms of uncoded symbol error rate, may be obtained through MMSE 
[338]. Pilots may also be distributed over multiple OFDM symbols, on different subcarriers in 
different OFDM symbols. This allows two-dimensional interpolation, which is valuable in channels 
that change over time [196]. The 3GPP LTE cellular standard includes pilots distributed across 
subcarriers and across time. 


Example 5.18 


In this example, we explore the preamble structure used in IEEE 802.11a, shown in Figure 5.25, and 
explain how it is used for channel estimation [160, Chapter 17]. A similar structure is used in IEEE 
802.11g, with some backward compatibility support for 802.11b, and in IEEE 802.11n, with some 
additions to facilitate multiple-antenna transmission. The preamble in IEEE 802.11a consists of a 
short training field (STF) and a channel estimation field (CEF). Each field has the duration of two 
OFDM symbols (8s) but is specially constructed. In this problem, we focus on channel estimation 
based on the CEF, assuming that frame synchronization and carrier frequency offset correction have 
been performed. 


16s 4us 


PLCP preamble SIGNAL DATA 
12 symbols One OFDM symbol Variable number of OFDM symbols 
fa[e|e[e|ofals[e|e[| oz | & | & 


Short training field 8s Channel estimation field 81s 
Figure 5.25 Frame structure defined in the IEEE 802.11a standard. In the time domain, the 
waveforms f, through ¢,,9 are identical (ten repetitions of short training) and 7; and T, are the same 
(two repetitions of long training). The Physical Layer Convergence Procedure (PLCP) is used to 
denote the extra training symbols inserted to aid synchronization and training. 


The CEF of IEEE 802.11a consists of two repeated OFDM symbols, each containing training data, 
preceded by an extra-long cyclic prefix. Consider a training sequence of length NV = 64 with values 


ie 0 nT a (5.211) 


{t[n] _s = {1,—-1,—1,1,—-1,1,—-1,1,1,1,1,1} (5.212) 

Be ge ll ed Dh ig SR Dy ig DE hl (5.213) 
n=3 

tin] ho sg = 41,1, =1,—1,1,1,=1, 1, =1,1,1,1,1 (5.214) 
: 50 — j 


where all other subcarriers are zero. IEEE 802.11a uses zero subcarriers to help with spectral 
shaping; a maximum of only 52 subcarriers is used. The n = 0 subcarrier corresponds to DC and is 
also zero. The CEF is constructed as 


forn=0,1,..., 159. Essentially the CEF has one extended cyclic prefix of length ZL, = 32 (extended 
because it is longer than the normal L, = 16 prefix used in IEEE 802.11a) followed by the repetition 
of two IDFTs of the training data. The repetition is also useful for frequency offset estimation and 
frame synchronization described in Section 5.4.4. Time-domain channel estimation can proceed 
directly using (5.215). For estimation ae the frequency domain, we define two slightly different 
truncated received signals ‘iln] = yln + 32) and ya{r] = yln + 96] forn=0,1,..., 63. Taking 
the DFT gives {yi [hi =0 and {y2[h' b= “- 0. The parts of the received signal senespedlinn to the 
nonzero training LY! [A] Fee i {yi [A] Re =38, {y2|h beat, and {¥2{k] }f=3s can then be used for channel 
estimation by using a linear equation of the form 

h + a (5.216) 

V2 


y\ 
Yo 


5.3.3 Direct Least Squares Equalizer 


In the previous sections, we developed channel estimators based on training data. These channel 
estimates can be used to devise equalizer coefficients. At each step in the process, both channel 
estimation and equalizer computation create a source of error since they involve solving least squares 
problems. As an alternative, we briefly review an approach where the equalizer coefficients are 
estimated directly from the training data. This approach avoids the errors that occur in two steps, 
giving somewhat more noise robustness, and does not require explicit channel estimation. 


PE 
PE 


and finding the least squares solution. 


To formulate this problem, again consider the received signal in (5.87), and the signal after 


equalization in (5.96), which gives an expression for s|n — Na], We now formulate a least squares 
problem based on the presence of the known training data s[n] = ¢[n] forn =0, 1,..., N,,. Given the 


location of this data, and suitably time shifting the observed sequence, gives 


Ls 
tn = ». Fina (Cly|n + Ng — e| (5.217) 


&=0 


forn=0,1,...,N,,- Building a linear equation: 


t{0] y|nq| iil y(n — L| Ina 0) 
t{1 o “hy frall 
| _ y[na + 1] i (5.218) 
t [Mer = 1] y|na |. Ni ra 1) ata y|nq ++ Ner — Ls] Tors [Li] 
—_—_—_—_” —_—_—_—_—_——_— 
t Vu fg 


d 


In matrix form, the objective is to solve the following linear system for the unknown equalizer 
coefficients: 


oe a ee (5.219) 


IfLp+ 1 >N,, then Y,,, is a fat matrix, and the system of equations is undetermined. This means that 
there is an infinite number of exact solutions. This, however, tends to lead to overfitting, where the 
equalizer is perfectly matched to the observations in Y,, . A more robust approach is to take Ly < M, — 
I, turning Y,, , into a tall matrix. It is reasonable to assume that Y,, x is full rank because of the 


presence of additive noise in y[n]. Under this assumption, the least squares solution is 


f,,=(Y¥*.Yn,) Yt (5.220) 


Ned Na * 


* * 5 ; =J * 
The squared error is measured as Ji[na] = t*t — t*Yn, (Yn. Yna) Ynat. The squared error 


can be minimized further by choosing ng such that J,[1g] 1s minimized. 


A block diagram including direct equalization is illustrated in Figure 5.26. The block diagram 
assumes that the equalization computation block optimizes over the delay and outputs the 
corresponding filter and required delay. Direct equalization avoids the separate channel estimation 


and equalizer computation blocks. 
Symbol : 


st)-+|C/D g, [a] 
Figure 5.26 A complex pulse-amplitude modulation receiver with direct equalizer estimation and 
linear equalization 


The direct and indirect equalization techniques have different design trade-offs. With the direct 


approach, the length of the training determines the maximum length of the equalizer. This is a major 
difference between the direct and indirect methods. With the indirect method, an equalizer of any 
order Ly can be designed. The direct method, however, avoids the error propagation where the 
estimated channel is used to compute the estimated equalizer. Note that with a small amount of 
training the indirect method may perform better since a larger L> can be chosen, whereas a direct 
method may be more efficient when N,, is larger. The direct method also has some robustness as it 
will work even when there is model mismatch, that is, when the LTI model is not quite valid or there 
is interference. Direct equalization can be done perfectly in multichannel systems as discussed further 
in Chapter 6. 


5.4 Carrier Frequency Offset Correction in Frequency-Selective Channels 


In this section, we describe the carrier frequency offset impairment for frequency-selective channels. 
We present a discrete-time received signal model that includes frequency offset. Then we examine 
several carrier frequency offset estimation algorithms. We also remark how each facilitates frame 
synchronization. 


5.4.1 Model for Frequency Offset in Frequency-Selective Channels 


In Section 5.1.6, we introduced the carrier frequency offset problem. In brief, carrier frequency offset 
occurs when the carrier used for upconversion and the carrier used for downconversion are different. 
Even a small difference can create a significant distortion in the received signal. 

We have all the ingredients to develop a signal model for frequency offset in frequency-selective 
channels. Our starting point is to recall (5.67), which essentially says that carrier frequency offset 
multiplies the matched filtered received signal by e?"/c’, Sampling at the symbol rate, and using our 
FIR model for the received signal in (5.87), we obtain 


iB 
ee sin — £] + v[nl]. (5.221) 
(= 


It is possible to further generalize (5.221) to include frame synchronization by including a delay of d. 
Correction of carrier frequency offset therefore amounts to estimating ¢ and then derotating the 


received signals e ?™€"y[n]. 


In the frequency-flat case, the frequency offset creates a successive rotation of each symbol by 
exp(j27¢n). In the frequency-selective case, this rotation is applied to the convolutive mixture 
between the symbols and the channel. As a result, the carrier frequency offset distorts the data that 
would otherwise be used for channel estimation and equalization. Even if training data is available, it 
does not lead directly to a simple estimator. A direct extension of what we have studied leads to a 
joint carrier frequency offset and channel estimation problem, which has high complexity. In this 
section, we review several lower-complexity methods for frequency offset estimation, which rely on 
special signal structure to implement. 


5.4.2 Revisiting Single-Frequency Estimation 

For our first estimator, we revisit the estimator proposed in Example 5.8 and show how it can also be 

used in frequency-selective channels with minimal changes. This type of sinusoidal training was used 

in the GSM system through a special input to the GMSK (Gaussian minimum shift keying) modulator. 
Let us choose as a training sequence ¢[n] = exp(j2af,n) forn =0,1,..., N,,— 1 where f, is a 

discrete-time design frequency. Consider the received signal (5.87) with s[n] =¢[n] forn =0,1,..., 

N,- — 1. Discarding samples that do not depend on the training, we have the received signal 


L 
y[n] =e?" S © hfdt[n — + v[n] (5.222) 
é=0 
forn=L,L+1,...,N,— 1. Substituting for the training signal, 
L . F 4 
yln] = emer 5 h[gePF-9 + vin) (5.223) 
£=0 
L ~ 
= eiamenelankn  " hidje re? + vin] (5.224) 
é=0 
= ¢i2men jer finp, (27h )+v[n] (5.225) 


where h (e?%/t) is the DTFT of h[n]. Derotating by the known frequency of the training signal, 


eeu ht y(n] — ei2menp, (e274) rs e i2tht v[n]. (5.226) 


This has the form of (5.72) from Example 5.8. As a result, the estimators for frequency-flat frequency 
offset estimation using sinusoidal training can also be applied to the frequency-selective case. 

The main signal processing trick in deriving this estimator was to recognize that signals of the form 
exp(j27f,n) are eigenfunctions of discrete-time LTI systems. Because the unknown channel is FIR with 
order L, and the training N,, > Z, we can discard some samples to remove the edge effects. The 
frequency f, could be selected of the form k/N,, to take advantage, for example, of subsequent 
frequency-domain processing in the modem. 

The main advantage of single-frequency estimation is that any number of good algorithms can be 


this approach is that performance is limited by h (e"/t). Since the channel is frequency selective by 
assumption and has L zeros because it is FIR of order L, it could happen that the chosen frequency of 
f, 1s close to a zero of the channel. Then the SNR for the estimator would be poor. The sinusoidal 
training also does not lead to a sharp correlation peak that can be used for frame detection. 
Furthermore, if used to construct T in (5.175), it would in fact not have full rank, and thus it could not 
be used for channel estimation. As a result, we now review other approaches for carrier frequency 
offset estimation. 


5.4.3 Frequency Offset Estimation and Frame Synchronization Using Periodic 
Training for Single-Carrier Systems 


There are several different algorithms for frequency offset estimation using different properties of the 
transmitted signal such as periodicity, constant modulus, and so on. One of the most elegant methods 
was proposed by Moose [231] and has since been studied extensively. This method relies ona 
special periodic training sequence that permits joint carrier frequency offset estimation and frame 
synchronization. The training sequences can also be used for channel estimation. Periodic training has 
found application in IEEE 802.11a/g/n/ac systems and others. In this section, we consider the 
application of periodic training to a single-carrier system, though note that Moose’s original 
application was to OFDM. We deal with the case of OFDM, including another algorithm called the 
Schmidl-Cox approach [296], in the next section. 


Now we explain the key idea behind the Moose algorithm from the perspective of using just two 
repeated training sequences. Other extensions are possible with multiple repetitions. Consider the 
framing structure illustrated in Figure 5.27. In this frame, a pair of training sequences are sent 
together, followed by a data sequence. 


Frame 
eco | eee 
——_——™ —71.—--—————_ 
Ne Ne Nwr2Ny data symbols 
Training 
Figure 5.27 The framing used in the Moose estimator. A pair of training sequences are followed by 


data. 


The Moose algorithm exploits the periodicity in the training sequence. Let the training sequence 
start at n = 0. Then 


s[n] = s[n + Mr] = t[n] (5.227) 
forn=0,1,...,N,,— 1. Note that symbols s[”] for n < 0 and n = 2N,, are unknown (they are either 


zero or correspond to the unknown portions of the data). 
Now we explain the trick associated with periodicity in the training data. For n © [Z, N,,— 1], 


L 
y(n] = len =. hlé|s[n — | + v[n] (5.228) 
€=0 
L 
yl + Ne] = el2reln ter) 2 hléjs[n + Mer — U] + v[n + Mer]. (5.229) 
£=0 


Using the fact that s[n + N,,] =s[n] =¢[n] forn=0,1,...,N,—1: 


L 
y[n + Nex] = el27eNtr pj2men 2. hléjt{n — €] + v[n + Mer] (5.230) 
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py l2teNer yin], (5.231) 
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To see the significance of this result, remember that the channel coefficients {h{4] }e—0 are unknown! 

The beauty of (5.231) is that it is a function of known observations and only the unknown frequency 

offset. Essentially, the periodicity establishes a relationship between different parts of the received 

signal y[n]. This is an amazing observation, since it does not require any assumptions about the 

channel. 


There are several possible approaches for solving (5.231). Note that this is slightly different from 
the single-frequency estimator because of the presence of y[] on the right-hand side of (5.231). The 
direct least squares approach is not possible since the unknown parameter is present in the exponent 
of the exponential. A solution is to consider a relaxed problem 


y[n + Nir] =ay(n], (5.232) 


solving for Gs and then finding ¢ from the phase. This problem is similar to flat-fading channel 
estimation in Section 5.1.4. Using (5.47), we can write 


Ner—l = 
é ! ney (nly[n + Nex 
éLs = phase feo + Nel (: 


2a Nex lta y*|n|y(n| 
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233) 


Nex —] 
1 ; ; 
= Na phase ( iS y*(njy[n + rs) (5.234) 


n=L 


where we can neglect the denominator since it does not contribute to the phase estimate. Despite the 
relaxation in this derivation, it turns out that this is also the maximum likelihood estimator [231]. 


There is an important limitation in the Moose algorithm. Because of the periodicity of discrete-time 


exponentials, the estimate of ¢ is accurate only for eNtr| S 3 or equivalently 
le| < - : ; (5.235) 
= ON, | 
which in terms of the actual frequency offset means 
lfel < —_ (5.236) 
2T Nir | 


This reveals an interesting trade-off in the estimator performance. Choosing larger N,. improves the 


estimate since there is more noise averaging, but it reduces the range of offsets that can be corrected. 
A way of solving this problem is to use multiple repetitions of a short training sequence. IEEE 
802.1la and related standards use a combination of both repeated short training sequences and long 
training sequences. 


Example 5.19 
Compute the maximum allowable offset for a 1Ms/s-QAM signal, with f, = 2GHz and N, = 10. 


Answer: 

ld 1 e oer" 
max| f= oTN (5.237) 

ae 4 tr 

a ee | 
= -10°— (5.238) 

2 10 
=5 10° = 50kHz. (5.239) 


In terms of parts per million, we need an oscillator that can generate a 2GHz carrier with an accuracy 
of 50e3/2e9 = 25ppm. 


Example 5.20 


Consider a wireless system where each data frame is preceded by two training blocks, each 
consisting of N,, = 12 training symbols. Let the symbol period be T= 4us. What is the maximum 


frequency offset that can be corrected using training? 


Answer: The maximum frequency offset that can be corrected using training is ¢ = 1/(2N,,) ~ 
0.0417 or f, = 1/(2TN,,) ~ 10.41 7kHz. 


The Moose algorithm also provides a nice way of performing frame synchronization. The 
observation is that the correlation peak should occur when the pair of training sequences is 
encountered at the receiver. Essentially, this involves solving for the offset d such that 


pana yin + A+ Merly*(n + Al) 


(SX lyin + Al?) 


where the search is performed over a reasonable set of possible values of d. For example, the analog 
RF may perform carrier sensing, activating the ADCs only when a sufficiently strong signal is 
present. 


~ 


d = arg, max 


(5.240) 


The denominator normalization in the frame synchronization is required to avoid false positives 
when there is no signal present. An alternative solution, which is somewhat more robust in this case, 
involves normalizing by both observed vectors as in 


2 
Sony yn + A+ Nely*[n + A] 
(ORT twin + ANP?) (OAT yin + A + Nell?) 


d = arg, Max 


No matter which algorithm is used, both result in the same thing: a joint solution to the frame 
synchronization and frequency offset estimation and correction problem in an intersymbol interference 
channel. 


Channel estimation is also facilitated using the Moose algorithm. Once the frequency offset is 
estimated and corrected, and the frame is synchronized, the pair of training sequences can be 
combined for channel estimation. As a result, periodic training provides a flexible approach for 
solving key receiver functions in frequency-selective channels. 

An illustration of the complete receiver can be found in Figure 5.28. The frequency offset 
estimation and correction are performed prior to channel estimation and equalization but after the 
downsampling operation. Better performance could be achieved by operating before the 
downsampling, replacing the symbol synchronizer, but this is usually practical only for smaller values 


of M,.. 
" Symbol - 
z" Ly Sc 2 " 


4 


r(t)—>|C/DF 1g, [7] 


— Symbol ; ged | Equalizer 
M. sync offset computation 
estimation 


Channel 
a 
estimator 


Figure 5.28 A single-carrier receiver with frequency offset estimation and correction, frame 
synchronization, channel estimation, and linear equalization. The linear equalization could be 
replaced by an SC-FDE receiver structure with no other changes required. 


We conclude this section with a detailed example that describes how to use the preamble structure 
in IEEE 802.15.3c single-carrier mode and in IEEE 802. 11a. 


Example 5.21 

Consider the IEEE 802.15.3c preamble structure as described in Example 5.17 for the high-rate mode 
of the SC-PHY. In this example, we describe how the SYNC field is used for carrier frequency offset 
estimation and frame synchronization. Note that the standard just describes the preamble itself; it does 
not describe how the receiver signal processing should be applied to the preamble. A good 
description of digital synchronization algorithms for IEEE 802.15.3c is found in [204]. 

The SYNC field can be used for frame synchronization and carrier frequency offset estimation. The 
SYNC field contains 14 repetitions of a;52. Because the maximum supported channel length is L, = 
128, though, the first repetition acts as a cyclic prefix. Therefore, we can use the following frame 
detection algorithm: 

2 2 
ws 13 eee y[n + A + 128p + 128] y*[n + A + 128p] | | 
d = arg max > nh a Sa (5.242) 

A p=! tae ly[n + A+ 128+ 128p]|”) 


where we have put the packet energy in the denominator to make the algorithm more robust when 
averaging over multiple repetitions. The carrier frequency offset can then be estimated from either 
averaging the offset with each period and combining or taking the phase of the average as 


13 127 
- 1 . ~ : ‘ ~ 
€= ——— 2 phase (>: y[n + d+ 128p + 128]y*[n + d+ 1) (5.243) 
ai aC p 


n=O0 


The maximum absolute offset correctable is 1/256. 


In packet-based wireless systems, the digital part of the receiver may be in sleep mode to save 
power. As a result, the analog may make an initial determination that there is a signal of interest, for 
example, if the signal exceeds a threshold. The SYNC field can be used for this initial determination. 
In the process, though, some samples may be lost, and not all repetitions will be available for 
averaging. As a result, it may make sense to average over fewer repetitions in (5.242). It also may 
make sense to either defer frequency offset correction or make a correction based on this tentative 
estimate (called a coarse correction), then proceed with further correction and refinement using the 
SFD field. 


Example 5.22 


Consider the IEEE 802.11la preamble structure as described in Example 5.18. In this example, we 
explain the coarse frequency offset using the STF field and the fine frequency offset using the CEF 
field. We assume B = 20MHz bandwidth. The STF field (in the time domain) consists of ten 
repetitions (created by zeroing subcarriers as described in Section 5.4.4). The CEF field has two 
repetitions. Sampled at T= 1/B, each repetition of the STF contains 16 samples, whereas the CEF has 
two repetitions of 64 samples with a length 32 cyclic prefix. 


The STF field is used for functions such as packet detection, automatic gain control (AGC), and 
coarse synchronization. Because the RF may be powering up, and the AGC adjusting, not all of the ten 
repetitions are typically used for synchronization. For example, suppose that three repetitions are 
used with the first acting as a cyclic prefix. Then we use the following frame detection algorithm: 


aa aan y|n +A+ 16]y*(n is Al ‘ 
dcoarse = arg max -— A (5.244) 


A pa, ly[n + A+ 16)/?) 


The coarse carrier frequency offset estimate is then 


15 
és 1 ~ - 7 
j —_—= —-—— ase . — . ) mar 5 22 5 
Ecoarse 5716 nase (> y[n + deoarse + 16]y*[n + donna). (5.245) 
n=0 
The maximum absolute frequency offset correctable is 1/32. 
Next, we correct for the coarse frequency offset to create the new signal 


yin] = exp(—j27€coarse”)y|n| Using this corrected signal, we then search for the long training 
sequence: 


ae at og[n+ A + 64]y*[n + A] 


dae = arg max (5.246) 


. a ly|n Ah ae 32])”) 


The coarse frame synchronization estimate can be used to reduce the search space of possible offsets. 
The fine carrier frequency offset estimate is then 


63 
ss 1 ~— AE a’ Set 3 Re -_ 
€fne = 5764 Phase ( ) y[n + dane + 64] y*[n + dal). (5.247) 


n=0 


The two-step approach allows an initial frequency offset estimate with a larger correction range, 
but it is noisier in the first step because of the use of only 16 samples. Then, in the second step, it is 
possible to get a more accurate estimate by using 64 samples with a smaller range, assuming the first 
phase corrected for the larger errors. 


5.4.4 Frequency Offset Estimation and Frame Synchronization Using Periodic 
Training for OFDM Systems 


OFDM 1s sensitive to carrier frequency offset. The reason is essentially that carrier frequency offset 
over the duration of an OFDM symbol creates problems because the DFT is taken over the whole 
OFDM symbol period. Small variations are compounded by the DFT operation into bigger variations. 


Frequency offset correction algorithms for OFDM systems are similar to their time-domain 
counterparts. The method in Section 5.4.2 can be applied to OFDM by sending a OFDM symbol with 
just a single subcarrier active. The method in Section 5.4.3 can be applied directly only to multiple 
repetitions of OFDM systems. In this section, we develop another carrier frequency offset estimator 
that works on the periodicity created in one OFDM symbol. It also has an advantage over the Moose 
method in that it allows a larger range of offsets to be corrected, when used with a second specially 
designed OFDM symbol. 


We start by explaining how to create periodicity in a single OFDM symbol. The key insight is to 
make a portion of the transmitted signal look periodic and then to apply the same technique as before. 
Suppose that we construct an OFDM symbol where all the odd subcarriers are zero. Then 


w[n] = — S~ s[mJe (5.248) 


+ 2am nm sc.) 
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This looks like an extended discrete-time sinusoid with carrier 2n(V/2). As a result, forn © [L,, L, 
+ N/2], 


which means that the OFDM signal contains a portion that is periodic. As a result, the Moose 
algorithm can be applied directly to this setting. 


N 
2 


wl[n| = w | + (5.250) 


Consider the case where frame synchronization has already been performed. The received signal 
model is 


EL 
y[n] = eer Dy hl{é}w[n — €) + v[n). (5.251) 
£=0 
Discarding the cyclic prefix, 
y[n|] = y[n + L,]. (5.252) 
Because of the periodicity, 


g[n + N/2] = &?*N/2 GI), (5.253) 


Then the Moose frequency offset estimator is 


N/2-1 


iJ 
——— phase y|njyin+N|]. (5.254) 
sect (Se view 
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As before, it is possible to perform OFDM symbol synchronization (or frame synchronization) and 
frequency offset estimation jointly using this correlation method. The OFDM training symbol can even 
be used for channel estimation as this is just a special case of the comb-type pilot arrangement 
described in Section 5.3.2. 


The maximum correctable carrier frequency offset 1s 


1 


ae 
lel S 2N/2 
1 


a eae 5.256) 

N ( ) )) 
As a result, the Moose algorithm can essentially correct for continuous-time offsets that are less than 
one subcarrier spacing 1/(N7). Inan OFDM system, N may be quite large. This would in turn reduce 


the range of correctable offsets. Thus, it is of interest to develop techniques to enhance the range. One 
such approach is the Schmidl-Cox algorithm [296]. 


Consider an OFDM system with two training OFDM symbols. In the first symbol, all the odd 


N/2-1 s 
subcarriers are zero. The even subcarriers {41|2"|}n=0 are 4-QAM training symbols, which are 
nominally just a pseudo-noise sequence. The presence of zeros ensures that there is periodicity in the 


N-1 
first OFDM symbol. Let {t2["|}n=0 bea set of 4-QAM training symbols sent on the second OFDM 
symbol, without zeros. Further suppose that the even training symbols are selected such that 


Ba 


7 - 1. N/2-1. ae 
to[2n]tt[2n] = ts[2n] where {t3!"]}n0 — is another sequence with good periodic correlation 
properties. Effectively, the training data on even subcarriers of the second OFDM symbol is 
differentially encoded. 


The Schmidl-Cox algorithm works as follows. Let the frequency offset be written as 


2q 
— NV + €frac (5.25 r) 


where q is called the integer offset and ¢,,.,, 1s the fractional offset. Because 


el2meN/2 _ ei2m( 44 +€frac ) (5.258) 
— gi2n( WF +etrac) F (5.259) 
— gli2t tracy (5.260) 


the expression (5.253) is able to account for only fractional frequency offsets. Suppose that the 
estimator in (5.254) is used to estimate the fractional frequency offset. Further suppose that the 
estimate is perfect and is then removed by multiplying by exp(—j27¢5,,,V/2), leaving the received 


signal after correction as 
y(n] = citnin 5 hi win — é] + v[nl]. (5.261) 
£=0 


Discarding the cyclic prefix and taking the DFT gives 
y[k] = e227 _WLehl((k — 2q)) w]s[((k — 2¢))n] + Ok] (5.262) 


fork =0,1,...,N— 1. The received signals corresponding to each training symbol use the index k = 
Oc tesco A 


yi [hk] = e22* 8 Heh [((k — 2g) w]ta[((k — 2g) w] + OK] (5.263) 
yo[k] = e732" ¥Leni((k — 2q)) n]to[((k — 2¢))w] + DIK +N + Le]. (5.264) 
Now we exploit the fact that the training on the even subcarriers was differentially encoded. Consider 
yo[2k]y$ [2h] = |h[((2k — 2q)) w]|?t2[((2k — 24) w}¢t[((2k — 2q))v] + H'[k] (5.265) 
= |h{((k — 2q)),v]|?tg{((2k — 2q))n] + 0A (5.266) 
where ©’ |‘ contains the products with the noise terms. Because {t3[2n oie has good periodic 


correlation properties, a least-squares-type problem can be formulated and used to find the shift with 
a correlation peak: 


2 
peg ' yo[2k + Qplyt [2k + ple} [2k + 2p] 
q= ae ee ee (5.267) 
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Based on this integer offset estimate, the received samples can be corrected by e~/?* W "Then the 
second OFDM training symbol can be used to estimate the channel (possibly combined with the first 
OFDM symbol as well). 


The final frequency offset estimate obtained with the Schmidl-Cox algorithm is 


G 

Em =# Bee (5.268) 
When both fractional and integer offset corrections are applied, a large range of offsets can be 
corrected. While it seems like very large offsets can be corrected, recall that the system model was 
derived assuming a small offset. A large offset would shift part of the desired signal outside the 
baseband lowpass filter, rendering the signal models inaccurate. In practice, this approach should be 
able to correct for offsets corresponding to several subcarriers, depending on the analog filtering, 
digital filtering, and extent over oversampling. 


Like the Moose method, the Schmidl-Cox approach can also be used for OFDM symbol 
synchronization, by looking for a correlation peak around the first OFDM symbol as 


r fey = i 2 
SON2 gin + A + N/2}g"In + A 


d= arg max . (5.269) 


2 
i N {/2—1 — / Tife > 2 
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There is one distinct point of difference, though. With OFDM, there is also a cyclic prefix and often 
L, > L. In this case, then 


y[N + Le +n] x &?™"y[n} (5.270) 


forn=L,L+1,...,L,,L,+1,...,N+L£,—1. This means that there will be some ambiguity in 
(5.269), in that there may be several values of d that are close, especially when L, is much larger than 


L. This creates a plateau in the symbol synchronization algorithm [296]. Any value within the plataeu, 
however, should give acceptable performance if the estimated channel has order L,, but if the smaller 
channel length of Z is assumed, then performance may suffer. Other training sequence designs can 
improve the sharpness of the frame synchronization, specifically with multiple repetitions along with 
a sign change among the different repetitions [306]. 


A system block diagram with OFDM and frame synchronization, channel estimation, and carrier 
frequency offset correction 1s illustrated in Figure 5.29. One frequency offset correction block is 
shown, but this = be broken up ra two pieces | lace to integer and fine offsets. 
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Figure 5.29 OFDM receiver frequency offset estimation and correction, channel estimation, and 
linear equalization. The matched filtering and symbol sampling are omitted as they are often not 
performed in OFDM. 


Example 5.23 


In this example, we show how the STF in IEEE 802.11a is constructed to have periodicity. The STF 
is generated froma special training sequence with 12 nonzero values constructed as 


{t[4k]}i2.9 = {0, -1 —j, -1 —j,14+j,1+j,14+j,14), 
0,0,0, 0,14 3;—1—j1:+j,-1 —j, 1 +9} (5.271) 


for k=0,1,..., 15 and the other undefined values of the training are zero. 


The presence of zeros in (5.271) is a result of the need for eliminating the DC subcarrier and 
spectral shaping, similar to the design of the CEF. Based on this training sequence, 
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forn=0,1,..., 159. The time-domain waveform thus contains ten repetitions of length 16. 


5.5 Introduction to Wireless Propagation 


Of all the impairments, the propagation channel has the most impact on the design of a wireless 
receiver. The wireless channel causes the transmitted signal to lose power as it propagates from the 
transmitter to the receiver. Reflections, diffraction, and scattering create multiple propagation paths 
between the transmitter and the receiver, each with a different delay. The net result is that wireless 
propagation leads to a loss of received signal power as well as the presence of multipath, which 
creates frequency selectivity in the channel. In this section, we provide an introduction to the key 
mechanisms of propagation. Then we rationalize the need for developing models and explain the 
distinction between large-scale models (discussed further in Section 5.6) and small-scale models 
(discussed further in Section 5.7 and Section 5.8). In this section, we briefly describe important 
factors affecting propagation in wireless channels to explain the need for several different channel 
models. The interested reader is referred to [270] or [165] for more extensive treatment of wireless 
propagation. 


5.5.1 Mechanisms of Propagation 


In a wireless communication system, a transmitted signal can reach the receiver via a number of 
propagation mechanisms. In this section, we review at a high level these key mechanisms, each 
potentially associated with a different propagation path. These mechanisms are illustrated in Figure 
5.30. 


ACCESS POINT 


CLIENT 1 


Figure 5.30 The mechanisms of propagation in the context of an indoor wireless local area 
network. The LOS path is unobstructed between the access point and client 1. That client also 
receives weaker signals as a result of a reflection off of a wall. The LOS path is obstructed for 
client 2, who instead receives signals through diffraction in the doorway and also scattering off of 
a rough wall. 


When a signal reaches the receiver from the transmitter in a single path, without suffering any 
reflections, diffractions, or scattering, this is known as propagation along the /ine-of-sight (LOS) 
path. An LOS component has the shortest time delay among all the received signals and is usually the 
strongest signal received. The exact classification of a path being LOS requires that any obstructions 
be sufficiently far away from the path, which is quantified by the idea of the Fresnel zone [270]. 


In non-line-of-sight (NLOS) propagation, a signal transmitted into a wireless medium reaches the 
receiver via one or more indirect paths, each having different attenuations and delays. When a 


transmitted signal travels through communication paths other than the LOS path to reach the receiver, 
itis said to have undergone NLOS propagation. NLOS propagation is responsible for coverage 
behind buildings and other obstructions. The main NLOS propagation mechanisms are reflection, 
scattering, and diffraction. 


Reflection occurs when a wave impinges on an object that is smooth, which means that any 
protrusions have dimensions much larger than a wavelength. Reflection is accompanied by refraction 
(transmission of the wave through the object). The strengths of the reflected and refracted waves 
depend on the type of material. The angles and indices of reflection and refraction are given by 
Snell’s law. 


Scattering is what happens when a wave impinges on an object that is rough or has irregularities 
with dimensions on the order of the wavelength. It is similar to reflection but results in a smearing of 
the signal around the angle of reflection. This leads to a larger loss of energy as the signal is spread 
over a wider area. It also results in multiple paths arriving at the receiver from a similar location 
with slight differences in delay. 


Diffraction is the “bending” of waves around sharp corners. Important examples of diffraction 
include waves bending over the tops of buildings, around street corners, and through doorways. 
Diffraction is one of the main ways that it is possible to provide cellular coverage in cities and is one 
reason why lower frequencies, say less than 3GHz, are considered beachfront property in the world 
of cellular spectrum. 


There are other mechanisms of propagation as well, such as tropospheric or ionospheric scattering, 
but these are not common in land mobile systems. They do have relevance, though, for battlefield 
networks and for amateur radio enthusiasts. 


5.5.2 Propagation Modeling 


Propagation has an impact on several aspects of radio link performance. It determines the received 
signal strength and thus the signal-to-noise ratio, throughput, and probability of error. Propagation 
also plays a major role in system design and implementation. For example, it determines the length of 
the discrete-time channel, which determines how much equalization is required and in turn how much 
training is required. The speed at which the channel varies determines the frequency of equalization 
and the frequency of training—in other words, how often the channel must be reestimated. 


An important component of the study of wireless communication is propagation modeling. A 
propagation model is a mathematical model (typically stochastic) to characterize either the 
propagation channel or some function of the propagation channel. Some models try to model the 
impulse response of the channel, whereas others try to model specific characteristics of the channel 
like the received power. Propagation models are usually inspired by measurement campaigns. Some 
models have many parameters and are designed to model specified real-world propagation scenarios. 
Other models have few parameters and are more amenable for tractable mathematical analysis. 


There are many ways to classify propagation models. A common first-order classification 1s 
whether they describe large-scale or small-scale phenomena. The term scale refers to a wavelength. 
Large-scale phenomena refer to propagation characteristics over hundreds of wavelengths. Small- 
scale phenomena refer to propagation characteristics in an area on the order of a wavelength. 


To illustrate different large-scale and small-scale phenomena, consider the average received signal 
power as a function of distance in Figure 5.31. Three different received signal power realizations are 


plotted. The first model is the mean (or median) path loss, which characterizes the average signal 
behavior, where the average is taken over hundreds of wavelengths. In most models, this is an 
exponential decay with distance. The second model is path loss with shadow fading. Here large 
obstructions like buildings or foliage are included to provide variability around the mean on a large 
scale. Shadow fading is often modeled by adding a Gaussian random component (in decibels) 
parameterized by the standard deviation to the mean path loss. The third model also includes small- 
scale fading, where the signal level experiences many small fluctuations as a result of the constructive 
and destructive addition of the multipath components of a transmitted signal. Because the fluctuations 
happen over a much smaller spatial scale, it is common to develop separate models for large-scale 
effects (like mean path loss and shadowing) and small-scale effects (like multipath fading). 
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Figure 5.31 Representation of the large-scale (distance-dependent path loss and shadowing) and 
small-scale (fading) propagation effects 


In essence, large-scale fading models describe the average behavior of the channel in a small area 
and are used to infer channel behavior over longer distances. Small-scale fading models describe the 
localized fluctuations in a given area and may be location dependent. 


Models for both large-scale and small-scale propagation phenomena are important. Large-scale 
trends influence system planning, the link budget, and network capacity predictions, and they capture 
the “typical” loss in received signal strength as a function of distance. Small-scale trends influence 
physical-layer link design, modulation schemes, and equalization strategies by capturing local 
constructive and destructive multipath effects. The received signal processing algorithms depend 
more strongly on small-scale models, but the net performance of those algorithms in a system depends 
on the large-scale models as well. 


Propagation models are widely used for wireless system design, evaluation, and algorithm 
comparison. Standards bodies (IEEE 802.11, 3GPP, etc.) find them useful for enabling different 
companies to compare and contrast performance of different candidate schemes. Often these models 
are chosen to suit certain characteristics like propagation environment (urban, rural, suburban) or 
receiver speed (fixed, pedestrian, or high speed). The models typically have many possible 
parameter choices and are used primarily as part of system simulation. 


In this section, we focus on developing models for the discrete-time equivalent channel. We 
decompose the channel taps as a product of a large-scale coefficient and a small-scale coefficient as 


h[é] = VGh, i ae. L. (5.275) 


The large-scale gain is G = E,/P,, i,,(d) where P,, j,(@) is the distance-dependent path-loss term in 
linear (to distinguish it from the more common decibel measure P,,(d)). The small-scale fading 
coefficient is denoted by /,[ 0]. The path loss is the ratio of the transmit power to the receive power 


and is often called the path gain for this reason. With complex pulse-amplitude modulation, for 
example, the transmit power is £,/T7, so the received power is (£,/7)/P,. j,(@). In Section 5.6, we 
describe models for P,,(d), including free space, log distance, and shadowing. In Section 5.8, we 

IIL 
describe models for t/’s|¢) be=0, some that are good for analysis like the IID Rayleigh fading model, 
and others that are inspired by physical mechanisms of propagation like the clustered model. 


5.6 Large-Scale Channel Models 

In this section, we review several large-scale models for signal strength path loss as a function of 
distance, denoted by P,(d). We review the Friis free-space equation as a starting point, followed by 
the log-distance model and the LOS/NLOS path-loss model. We conclude with some performance 
calculations using path-loss models. 


5.6.1 Friis Free-Space Model 


For many propagation prediction models, the starting point has been the Friis free-space model [165, 
270]. This model is most appropriate for situations where there are no obstructions present in the 
propagation environment, like satellite communication links or in millimeter-wave LOS links. The 
Friis free-space equation is given by 


a (d) _ Prx.1inGt,linGr,linX” 
rx,lin\ (4)2d? 


(5.276) 


where the key quantities are summarized as follows: 
* Pix tin 18 the transmit power in linear. For complex pulse-amplitude modulation Pi, jj, = £,/T. 
¢ dis the distance between the transmitter and the receiver. It should be in the same units as A, 
normally meters. 
¢ i is the wavelength of the carrier, normally in meters. 
* Gijin and G,,;, are the transmit and receive antenna gains, normally assumed to be unity unless 
otherwise given. 
A loss factor may also be included in the denominator of (5.276) to account for cable loss, impedance 
mismatch, and other impairments. 

The Friis free-space equation implies that the isotropic path loss (assuming unity antenna gains 
Gyjin = Gtjin = 1) increases inversely with the wavelength squared, x. This fact implies that higher 
carrier frequencies, which have smaller 4, will have higher path loss. For a given physical antenna 
aperture, however, the maximum gains for a nonisotropic antenna generally scale as G;,, = 424,/A7, 
where the effective aperture A, is related to the physical size of the antenna and the antenna design 
[270]. Therefore, if the antenna aperture is fixed (the antenna area stays “the same size’’), then path 


loss can actually be lower at higher carrier frequencies. Compensating for path loss in this manner, 
however, requires directional transmission with high-dimensional antenna arrays and is most feasible 
at higher frequencies like millimeter wave [268]. 


Most path-loss equations are written in decibels because of the small values involved. Converting 
the Friis free-space equation into decibels by taking the log of both sides and multiplying by 10 gives 
Pix(d) = Pix + Gt + G, + 20 logy 9(A) — 20 log, (47) — 20 log;)(d) (5.277) 


where P.,(d) and P,, are measured in decibels referenced to 1 watt (dBW) or decibels referenced to 
1 milliwatt (dBm), and G, and G, are in decibels. All other path-loss equations are given directly in 
terms of decibels. 

The path loss is the ratio of P,jin(Z) = Pex tin! Prxjin(@), or in decibels P,, — P,,(d). Path loss is 


essentially the inverse of the path gain between the transmitter and the receiver. Path loss is used 
because the inverse of a small gain becomes a large number. The path loss for the Friis free-space 
equation in decibels is 


P,(d) = 20 log;9(d) + 20 log;9(47) — Gy — G, — 20 log;9(A). (5.278) 


Example 5.24 
Calculate the free-space path loss at 100m assuming G, = G, = OdB and A = 0.01m. 


Answer: 
F180} = 20 log;,9(100) + 20 logi9(47) G; — G,; — 20 log;9(0.01) (5.279) 
= 102dB. (5.280) 


To build intuition, it is instructive to explore what happens as one parameter changes while keeping 
the other parameters fixed. For example, observe the following (keeping in mind the caveats about 
how antenna gains might also scale): 


¢ If the distance doubles, the path loss increases by 6dB. 
¢ If the wavelength doubles, the received power increases by 6dB. 


¢ If the frequency doubles (inversely proportional to the wavelength), the received power drops 
by 6dB. 


From a system perspective, it also makes sense to fix the maximum value of loss P,(d) and then to see 


how d behaves if other parameters are changed. For example, if the wavelength doubles, then the 
distance can also be doubled while maintaining the same loss. Equivalently, if the frequency doubles, 
then the distance decreases by half. This effect is also observed in more complicated real-world 
propagation settings. For example, with the same power and antenna configuration, a Wi-Fi system 
operating using 2.4GHz has a longer range than one operating at 5.2GHz. For similar reasons, 
spectrum in cellular systems below 1 GHz is considered more valuable than spectrum above 1GHz 
(though note that spectrum is very expensive in both cases). 


The loss predicted by the free-space equation is optimistic in many terrestrial wireless systems. 
The reason 1s that in terrestrial systems, ground reflections and other modes of propagation result in 


measured powers decreasing more aggressively as a function of distance. This means that the actual 
received signal power, on average, decays faster than predicted by the free-space equation. There are 
many other path-loss models inspired by analysis that solve this problem, including the two-ray 
model or ground bounce model [270], as well as models derived from experimental data like the 
Okumura [246] or Hata [141] models. 


5.6.2 Log-Distance Path-Loss Model 


The most common extension of the free-space model is the log-distance path-loss model. This classic 
model is based on extensive channel measurements. The log-distance path-loss model is 


P.(d) = a+ 108 log,,9(d) + (5.281) 


where a and f are the linear parameters and 7 is a random variable that corresponds to shadowing. 
The equation is valid for d > do, where dp is a reference distance. Often the free-space equation is 


used for values of d < dp. This results in a two-slope path-loss model, where there may be one 
distribution of n for d < dp and another one for d > do. 


The linear model parameters can be chosen to fit measurement data [95, 270]. It is common to 
select a based on the path loss derived from the free-space equation (5.278) at the reference distance 
dy. In this way, the a term incorporates the antenna gain and wavelength-dependent effects [312]. A 


reasonable choice for dy) = 1m; see, for example, [322]. The path-loss exponent is B. Values of B vary 


a great deal depending on the environment. Values of § < 2 can occur in urban areas or in hallways, in 
what is called the urban canyon effect. Values of 8 near 2 correspond to free-space conditions, for 
example, the LOS link in a millimeter-wave system. Values of B from 3 to as high as 5 are common in 
microcellular models [56, 282, 107], macrocellular models [141], and indoor WLAN models [96]. 


The linear portions of the log-distance path-loss model capture the average variation of signal 
strength as a function of distance and may be called the mean path loss. In measurements, though, there 
is substantial variation of the measured path loss around the mean path loss. For example, obstruction 
by buildings creates shadowing. To account for this effect, the random variable n is incorporated. It is 
common to select n as N(0,0 shod ) This leads to what is called log-normal shadowing, because n 1s 
added in the log domain; in the linear domain log;9(n) has a log-normal distribution. The parameter 
Oghaq WOuld be determined from measurement data and often has a value around 6 to 8dB. In more 


elaborate models, it can also be a function of distance. Under the assumption that n is Gaussian, P,(d) 


is also Gaussian with mean a + 10B log;o(d). For analytical purposes, is common to neglect 
shadowing and just focus on the mean path loss. 


Example 5.25 


Calculate the mean log-distance path loss at 100m assuming that 8 = 4. Compute a from the free-space 
equation assuming G, = G, = OdB, 4 = 0.01m, and reference distance dy = 1m. 


Answer: First, compute the reference distance by finding a = P,(1) from the free-space equation 
(3.278); 


P,(1) = 20 log, 9(1) + 20 log, (47) — G, — G; — 20log,,(0.01) (5.282) 
= 62dB. (5.283) 


Second, compute the mean path loss for the given parameters: 


P.(100) = 62dB + 10 - 4log,,(100) (5.284) 
= 62dB + 80dB (5.285) 
= 142dB. (5.286) 


Compared with Example 5.24, there is an additional 40dB of loss due to the higher path-loss 
exponent in this case. 


Example 5.26 
Consider the same setup as in Example 5.25. Suppose that the transmit power is P,, = 20dBm and that 
Oshad — SdB. 


¢ Determine the probability that the received power P,,(100) <—110dBm. 


Answer: The received signal power is 
P,,(100) = Pkx — P,(100) ( 
= Px — a — 108 log,,)(100) — 7 ( 
= 20dBm — 142dB — 7 (5.289) 
= —122dBm — 7. ( 


Now note that 
P[P,.x(100) < —110dBm] = P[—122dBm — n < —110dBm| (5.291) 
= P[—12dB < 7] (5.292) 


where the dBm disappears because dBm — dBm = dB (the 1mW cancels). Now let us rewrite 
this probability in terms of x ~ N (0, 1): 


—12dB 
P[P.x(100) < —110dBm] = P | = 2 7 (5.293) 
Oshad 
12dB 
=1-P| <2| (5.294) 
Oshad 
=p OS (5.295) 
~ 0.933. (5.296) 


This means that the received power is below —110dBm 93% of the time. 
¢ Determine the value of d such that P,,(d) <—110dBm 5% of the time. 


Answer: We need to find d such that P |Prx(d@) < —110dBm|} = 0.1 Following the same 
steps as in the previous part of the example, 


P[P,x(d) < —110dBm] = P [20dBm — 62dB — 40 log,9(d) — 7 < —110dBm] 


= P|[68dB — 40 log,,)(d) < 7] (5.298) 


_ | a8 — sPlosial@ < 7 (5.299) 


8dB 
40 log,,(d) — 68¢ | 
=1-p| Peet ee <a (5.300) 
oC 
, { 40 log,,9(d) — 68dB 
xia (ee ) (5.301) 
=O. (5.302) 


where we recognize that 68dB — 40 log,,(d) will be negative, so we can rearrange to get the 
final answer in the form of a Q-function. Using the inverse of the Q-function gives d = 27.8m. 


There are many variations of the log-distance path-loss model, with more refinements based on 
measurement data and accounting for other parameters. One common model is the COST-23 1 
extended Hata model for use between frequencies of 1S5OMHz and 2GHz. It contains corrections for 
urban, suburban, and rural environments [336]. The basic equation for path loss in decibels is 


P.(d) = 46.34+33.9 log, 9( fc) — 13.82 log, 9(ht)—b( hr) + (44.9—6.55 log, 9 (At )) log, 9(d)+C (5.303) 


where h, is the effective transmitter (base station) antenna height (30m < h, < 200m), h, is the receiver 
antenna height (Im </h, < 10m), d is the distance between the transmitter and the receiver in 
kilometers (d > 1km), b(/,) is the correction factor for the effective receiver antenna height, f, is the 


carrier frequency in megahertz, the parameter C = 0dB for suburban or open environments, and C = 
3dB for urban environments. The correction factor is given in decibels as 


b(hr) = (1.1log,o(fc) — 0.7) hr — (1.56 logyo(fe) — 0.8) (5.304) 


with other corrections possible for large cities. More realistic path-loss models like COST-231 are 
useful for simulations but are not as convenient for simple analytical calculations. 


5.6.3 LOS/NLOS Path-Loss Model 


The log-distance path-loss model accounts for signal blockage using the shadowing term. An 
alternative approach is to differentiate between blocked (NLOS) and unblocked (LOS) paths more 
explicitly. This allows each link to be modeled with potentially a smaller error. Such an approach is 
common in the analysis of millimeter-wave communication systems [20, 344, 13] and has also been 
used in the 3GPP standards [1, 2]. 

In the LOS/NLOS path-loss model, there is a distance-dependent probability that an arbitrary link 
of length d is LOS, which is given by P,,,(d). At that same distance, | — P;,,(d) denotes the 
probability that the link is NLOS. There are different functions for P,,,(d@), which depend on the 
environment. For example, 


Pros(d) = e787? (5.305) 


is used for suburban areas where C = 200m in [1]. For large distances, P;,,(d) quickly converges to 
zero. This matches the intuition that long links are more likely to be blocked. 

It is possible to find values of C for any area using concepts from random shape theory [20]. In this 
case, buildings are modeled as random objects that have independent shape, size, and orientation. For 
example, assuming the orientation of the buildings is uniformly distributed in space, it follows that 


mee a (5.306) 
AplagE [Poidg] | 


where Apigg is the average number of buildings in a unit area and E |Poiag] is the average perimeter of 


the buildings in the investigated region. This provides a quick way to approximate the parameters of 
the LOS probability function without performing extensive simulations and measurements. 
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Let °° (4) denote the path-loss function assuming the link is LOS, and let Pe'°(d) denote the 
path-loss function assuming the link is NLOS. While any model can be used for these functions, it is 
common to use the log-distance path-loss model. Shadowing is often not included in the LOS model 
but may be included in the NLOS model. The path-loss equation is then 

P.(d) = I(pios(d)) P!°*(d) + I(pjos(d)) PS (d) (5.307) 


I / 


where p,,.(@) is a Bernoulli random variable which is 1 with probability P,,.(d), and I(-) is an 


indicator function that outputs 1 when the argument is true and 0 otherwise. For small values of d the 
LOS path-loss function dominates, whereas for large values of d the NLOS path-loss function 
dominates. 


Example 5.27 
Consider the LOS/NLOS path-loss model. Assume free-space path loss for the LOS term with G, = G, 
= 0dB, 4 = 0.01m, and a reference distance d) = 1m. Assume a log-distance path loss for the NLOS 


term with B = 4 and a computed from the free-space equation with the same parameters as the LOS 
function. Assume LOS probability in (5.305) with C = 200m. 


Compute the mean signal power at d = 100m. 
Answer: First, compute the expectation of (5.307) as 
E[P,(d)] = E[l(pros(d))] Pr°*(d) + Ell (pios(d))] Pr''*(d) (5.308) 


= Pios(d) P.°*(d) + (1 — Pros(d))P™°*(d). (5.309) 


Evaluating at d= 100 and using the results from Example 5.24 and Example 5.25, 


E[P,(100)] = e~ 19/21 02dB + (1 — e7 100/20) 1 49dB (5.310) 
= 0.61 - 102dB + 0.39 - 142dB (5.311) 
= 117.6dB. (5.312) 


Example 5.28 
Consider the same setup as in Example 5.27. Suppose that the transmit power is P,, = 20dBm. 
Determine the probability that the received power P,.,(100) < —110dBm. 


Answer: The received signal power assuming LOS is 20dBm — 102 = —82dBm using the result of 
Example 5.24. The received power assuming NLOS is 20dBm — 142 = —122dBm using the results of 
Example 5.25. In the LOS case, the received power 1s greater than —110dBm, whereas it is less than 
—110dBm in the NLOS case. Since there is no shadow fading, the probability that the received power 
P.,(100) < —110dBm is P,,,(100) = 0.61. 


5.6.4 Performance Analysis Including Path Loss 


Large-scale fading can be incorporated into various kinds of performance analysis. To see the impact 
of path loss on the algorithms proposed in this chapter, a natural approach is to include the G term in 
(5.275) explicitly when simulating the channel. To illustrate this concept, we consider an AWGN 
channel, deferring more detailed treatment until after small-scale fading has been introduced. 
Consider an AWGN channel where /?|/] = VG6 “J and G= E,/Pjin(@). The SNR in this case is 
SNR 
SNe n= =: (5.313) 
a P. jin (d) . 


Note that SNRp (a) may be a random variable if shadowing or blockage is included in the path-loss 
model. Given a realization of the path loss, the probability of symbol error can then be computed by 


9AM (_SNR 
Fe Py jin (d) 
5) 


substituting SNRp. (a) into the appropriate probability of symbol error equation 
for example, for M-QAM in (4.147). The average probability of symbol error can be computed by 

| | EP. (e385) 
taking the expectation with respect to the random parameters in the path loss as Prain(€) } | Tn 


some cases, the expectation can be computed exactly. In most cases, though, an efficient way to 
estimate the average probability of symbol error is through Monte Carlo simulations. 


Example 5.29 


Compute the average probability of symbol error assuming AWGN and an LOS/NLOS path-loss 
model. 


Answer: Let us denote the path loss in the LOS case as P,, ji, jo,(@) and in the NLOS case as 
Ptinnlos(@), 10 linear units. Based on (5.307), using similar logic to that in Example 5.27, 


SNR . SNR 
E|P,(——_ || = pl*ayp, (=——a) + (1 — Pl*(d))P, Creare 0) 
| (=) (¢) Py. tin,los(@) , (¢)) Py Jin nlos( 


Path loss is important for analyzing performance in cellular systems, or more generally any 
wireless system with frequency reuse. Suppose that a user is located at distance d away from the 
serving base station. Further suppose that there are N,, interfering base stations, each a distance d, 


314) 


on 


away from the user. Assuming an AWGN channel, and treating interference as additional noise, the 
SINR is 


a 
P, r a 


Ey, ee: pa + No 


r: de ) 


SNR 
SNR Ss 5 ils a L 


r,lin (dp ) 


SINR |p. jin (d),{Petin(dn)}n = 


From this expression, it becomes clear that the ratio P,4,(@)/P,in(d,,) determines the relative 
significance of the interference. Ideally P,j,(d)/P,jin(@,) < 1, which means that the user is associated 
with the strongest base station, even including shadowing or LOS/NLOS effects. 


5.7 Small-Scale Fading Selectivity 


In this section, we discuss small-scale fading effects. These effects, which occur on the order of a 
wavelength distance, are caused by the constructive and destructive interference of multipath 
components. These effects manifest in the time domain, leading to time selectivity or, in the frequency 
domain, leading to frequency selectivity. These two forms of selectivity are independent of each 
other, under some assumptions. Before proceeding with the mathematical details, we first develop the 
intuition behind time- and frequency-selective channels. Then we present the foundations for 
determining whether a channel is frequency selective or time selective. We conclude with showing 
the potential system models used in each fading regime. 


5.7.1 Introduction to Selectivity 


In this section, we introduce frequency selectivity and time selectivity. We pursue these in further 
detail in subsequent sections. 


Frequency selectivity refers to the variation of the channel amplitude with respect to frequency. To 
understand how multipath plays an important role in frequency selectivity, consider the following two 
examples. Figure 5.32(a) illustrates the frequency response with a single channel tap. In the frequency 
domain, the Fourier transform of the single impulse function yields a flat channel; that is, the channel 
amplitude does not vary with frequency. We explored the flat channel in Section 5.1. Alternatively, in 
Figure 5.32(b), the channel impulse response has a significant amount of multipath. In the frequency 
domain (taking the Fourier transform of the sum of shifted impulse functions), the resultant channel 
varies considerably with frequency. 


[h.(t)| : 


time frequency 
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(b) 
Figure 5.32 A single path channel (a) and a multipath channel (b). The single path has a frequency- 
flat response whereas the multipath channel has a frequency-selective response. 


The effect of frequency selectivity, though, depends critically on the bandwidth used to 
communicate in that channel. For example, illustrated in Figure 5.33 are the discrete-time complex 
baseband equivalent channels for two different choices of bandwidth, 1MHz and 10MHz. With the 
smaller bandwidth, the impulse response looks nearly like a discrete-time delta function, 
corresponding to a flat channel. With the larger bandwidth, there are more taps (since the symbol 
period is smaller) and there are many more significant taps, leading to substantial intersymbol 
interference. Therefore, the same channel can appear frequency flat or frequency selective depending 
on the signal bandwidth. The range of frequencies over which the channel amplitude remains fairly 
constant is the coherence bandwidth. 
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(a) (b) 
Figure 5.33 (a) Discrete-time complex baseband equivalent channel for a five-path channel with a 
narrow bandwidth; (b) discrete-time complex baseband equivalent channel for a five-path channel 
with a bandwidth ten times larger 


Time selectivity refers to the variation in the channel amplitude as a function of time. The degree or 
extent of time selectivity of a channel is measured using the coherence time of the channel. This 
simply refers to the time duration over which the channel remains fairly constant. The coherence time 
provides guidance on when the received signal model can be assumed to be LTI. Time selectivity is a 
function of the mobility present in the channel and is usually measured using the Doppler spread or 
the maximum Doppler shift. 


For the coherence time to be useful, it must be compared with some property of the signal. For 
example, suppose that the receiver processes data in packets of length N,,,. For the LT assumption to 
be valid, the channel should be roughly constant during the entire packet. Then, if 7NV,,; is less than the 
coherence time, we might say that the LTI assumption is good and the channel is time invariant or 


slow fading. This does not mean that the channel does not vary at all; rather it means that during the 
window of interest (in this case N,,, symbols with period 7) the channel can be considered to be time 


invariant. If TN,., 1s greater than the coherence time, then we might say that the LTT assumption is not 
good and that the channel is time selective or fast fading. 
Note that the bandwidth also plays a role in determining the coherence time of a signal. Assuming a 


sinc pulse-shaping filter, the symbol period 7 = 1/B. As a result, making the bandwidth larger reduces 
the duration of the N,,, symbols. Of course, increasing the bandwidth makes it more likely that the 
channel is frequency selective. This shows how the time and frequency variability of a channel are 
intertwined. 

The time and frequency selectivity are coupled together. For example, if the receiver is moving and 
the multipaths come from different directions, each has a different Doppler shift. As a result, the time 
variability with several multipaths is more severe than with a single multipath. Fortunately, making 


some statistical assumptions about the channel allows the time selectivity to be decoupled from the 
frequency selectivity. Specifically, if the channel is assumed to satisfy the wide-sense stationary 
uncorrelated scattering assumption (WSSUS) [33], then the time-selective and frequency-selective 
parts of the correlation function can be decoupled, and the decision about a channel’s selectivity can 
be made separately based on certain correlation functions, discussed in the next sections. Quantities 
computed from these correlation functions are compared with the bandwidth of the signal of interest, 
the symbol period, and the block length N,,, to determine the effective selectivity. 


To explain this mathematically, suppose that the continuous-time complex baseband equivalent 
channel impulse response is A(t, t) where ¢ is the time and T is the lag. This is a doubly selective 
channel, in that it varies both with time and with lag. The channel is measured assuming a large 
bandwidth, typically larger than the eventual bandwidth of the signal that will be used in this channel. 
For example, measurements might be made in a SOOMHz channel and the results of those 
measurements used to decide how to partition the bandwidth among different signals. We use 
continuous time as this is historically how the WSSUS description was developed [33], but 
interpretations are also available in discrete time [152, 301]. 


Under the WSSUS model, a statistical description of the channel correlations is made through two 
functions: the power delay profile Rgojy(t), and the Doppler spectrum Spoppie(f). The power delay 


profile gives a measure of how the energy of the channel varies in the lag domain for two signals of 
very short duration (so Doppler is neglected). The Doppler spectrum measures essentially how much 
power is present in different Doppler shifts of two narrowband signals (so lag is neglected). 
Frequency selectivity is decided based on Rggy(t), and time selectivity is decided based on 


SDoppler(/)- In the next sections, we explore further the idea of determining the selectivity of a channel 
based on the power delay profile and the Doppler spectrum, or their Fourier transforms. 


5.7.2 Frequency-Selective Fading 
We use the power delay profile Rg.,,(t) to determine whether a channel is frequency-selective 


fading. The power delay profile 1s typically determined from measurements; common power delay 
profiles can be found in different textbooks and in many standards. For example, the GSM standard 
specifies several different profiles, including parameters like typical urban, rural, bad urban, and 
others. Intuitively, in a flat channel (in the bandwidth where the power delay profile was measured) 
Ractay(t) should be close to a delta function. 


The typical way to measure the severity of a power delay profile is based on the root mean square 
(RMS) delay spread. Define the mean excess delay as 


— fo_Ratetay (7) 7dr _ 
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and the second moment as 
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(5.318) 


Then the RMS delay spread is the difference: 


ORMS,delay = \/ T? — (7)?. (5.319) 


With this definition, a channel is said to be frequency flat if the symbol period satisfies T ~s. 
ORMSdelay: Lhis means that the effective spread is much smaller than a symbol, so there will be little 


ISI between adjacent symbols. 


Example 5.30 
Consider the exponential power delay profile Rggpy(t) = e “Y. Determine the RMS delay spread. 


Answer: The mean excess delay is 


Te @) ae la 
- Io re-T/Vdr _ 
7 = SS (5.320) 
for e-*/7dr 
Py 
I , 
= — (5.321) 
=“ (5.322) 
The second moment is 
oC —T/7,-2 
— el ydr 
T? = I ae (5.323) 
{, etd, 
23 
= — (5.324) 
= oy" (5.325) 


Therefore, the RMS delay spread is 


ORMS.,delay = (5.326) 
—+4 (5.327) 
Therefore, the value y is the RMS delay spread. 
The Fourier transform of the power delay profile is known as the spaced-frequency correlation 
function: 
®OO * 
Sdelay (Alag) = | Raclay (Te 7" "87 dr. (5.328) 
0 


It measures the correlation as a function of the difference A, = /,—f, between sinusoids sent on two 


different carrier frequencies. The spaced-frequency correlation function is used to define the 
coherence bandwidth of the channel. One definition of coherence bandwidth is the smallest value of 
Aigg Such that |S gotay(Beon)| = 9-5S detay(0). Essentially this is the first point where the channel becomes 


decorrelated by 0.5. 


It is common to define the coherence bandwidth based on the RMS delay spread. For example, 
1 


5ORMS, delay 


Beoh = (5.329) 
The coherence bandwidth is interpreted like traditional bandwidth and is meant to give a measure 
over which the channel is (statistically speaking) reasonably flat. In particular, a channel is flat if the 
bandwidth B < B,,,. There are several different definitions of coherence bandwidth in the literature; 
all have an inverse relationship with the RMS delay spread [191, 270]. 


Example 5.31 


Determine the spaced-frequency correlation function, the coherence bandwidth from the spaced- 
frequency correlation function, and the coherence bandwidth from the RMS delay spread for the 
exponential power delay profile in Example 5.30. 


Answer: The spaced-frequency correlation function is 


aes 
Sdelay (Atag) = | Réetay(7)e 32" 487 dr (5.330) 
0 
= / e~t/T e—)27 AingT qr (5.331) 
0 
1 
= , D.dd2 
; ie 7 j27Ajag ( , ) 
To find the coherence bandwidth from the spaced-frequency correlation function: 
S A | 5.333 
| delay | lag )| = \ . “a an? A? (5.< de ) 
and 
[Sdelay (0)| = ‘y. (5.334) 
The smallest nonnegative value of Aj,, that determines the coherence bandwidth is 
al 
Beoh = sie Bae (5.335) 
A 2 
Based on the RMS delay spread, 
! a 4 “ye , 
Beoh = —_ (5.336) 
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Since V3/4 = 0.43 and 1/5 = 0.2, these two expressions differ by a factor of 2. The coherence 
bandwidth based on the RMS delay spread 1s more conservative between the two measures in this 
case. 


In practice, the power delay profile or spaced-frequency correlation function is determined from 
measurements. For example, suppose that a training signal is used to generate channel estimate A[n, C] 
at time n. Then the discrete-time power delay profile can be estimated from M observations as 
1>N-lipt, gil2 , 

N Den=o lhln, 4] . The spaced-frequency correlation function S gey(Ajg) could be estimated by 
sending sinusoids at Af= f, — f; and estimating the correlation between their respective channels at 


several different times. Or it could be computed in discrete time using an OFDM system by taking the 
DFT of each channel estimate H[n, k] for a givenn, assuming K total subcarriers, then estimating the 


spaced-frequency correlation as a function of subcarriers as 


Sdelay [Ke — ky] = x Rial H[n, ky|H*[n, ko] 


n=0 


5.7.3 Time-Selective Fading 
The Doppler spectrum Spoppjer(f) is used to determine if a channel is time-selective fading. The 


Doppler spectrum can be estimated via measurements or more commonly is based on certain 
analytical models. Given a general Doppler spectrum, a common approach for determining the 
severity of the Doppler is to define an RMS Doppler spread Ogys doppler 11 the same way the RMS 


delay spread is defined. Then a signal is considered to be time invariant if B >> Opms doppler: In 


mobile channels, the maximum Doppler shift may be used instead of the RMS Doppler spread. The 
maximum Doppler shift is f,, =/cv/c where v is the maximum velocity and c is the speed of light. The 


maximum shift occurs when the transmitter 1s moving either straight to or straight from the receiver 
with velocity v. For many systems, the maximum Doppler shift gives a reasonable approximation of 
the RMS Doppler spread, typically leading to a more conservative definition of time selective. 


The maximum Doppler shift varies as a function of mobility. When there is higher mobility, the 
velocity is higher. From a system design perspective, we often use the maximum velocity to determine 
the coherence time. For example, a fixed wireless system might assume only pedestrian speeds of 
2mph, whereas a mobile cellular system might be designed for high-speed trains that travel at 
hundreds of miles per hour. 


Example 5.32 
Determine the maximum Doppler shift for a cellular system with f, = 1.9GHz that serves high-speed 
trains with a velocity of 300kn/h. 

Answer: The velocity of 300kn/7/h in meters per second is 83.3m/s. The maximum Doppler shift is 
then 

83.3 

2.97 x 10° 
= 533Hz. (5.338) 


fn = 1.9 x 10° (5.337) 


Analytical models can also be used to determine the Doppler spectrum. Perhaps the most common 
is the Clarke-Jakes model. In this model, the transmitter is stationary while the receiver is moving 
with a velocity of v directly toward the transmitter. There is a ring of isotropic scatterers around the 
receiver, meaning that multipaths arrive from all different directions and with different corresponding 
Doppler shifts. The Doppler spectrum under this assumption is 


- x./f2—f2 VE [= Fay Fea | ee 
SDoppler (Ff) a an - (5.339) 
V E [—fm:; fm]. 


Plotting the Clarke-Jakes spectrum in Figure 5.34, we see what is known as the horned Doppler 
spectrum. 
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Figure 5.34 The Clarke-Jakes spectrum plotted for f,, = 533Hz 


The time selectivity of the channel can also be determined from the spaced-time correlation 
function, which is the inverse Fourier transform of the Doppler spectrum: 
"OC 


Rpoppier(Atime) = | Spoppler( fern! df. (5.340) 


—00 


The spaced-time correlation function essentially gives the correlation between narrowband signals 
(so delay spread can be neglected) at two different points in time Ajne =f, — t. The coherence time 


of the channel is defined as the smallest value of Ajin, such that [Rp oppter(Atime)| = 9-SRpoppier(9). The 
resulting value is 7,,,}. It is common to define the coherence time based on either the RMS Doppler 


spread or the maximum Doppler shift, in the same way that the coherence bandwidth is defined. A 
channel is said to be LTT over block N,; tf TMiot — Toh: For the Clarke-Jakes spectrum, it is common 
to take T,,, = 0.423/f,, [270]. 


Example 5.33 


Determine the coherence time using the maximum Doppler shift for the same situation as in Example 
5.32. Also, if a single-carrier system with 1MHz of bandwidth is used, and packets of length N,,; = 


100 are employed, determine if the channel is time selective. 

Answer: The coherence time is 7,4 = 1/(5f,) = 0.375ms. With a bandwidth of 1MHz, Tis at most 
lps, assuming sinc pulse shaping, less if other forms of pulse shaping are used. Comparing N,,,7'= 
100us with T.,,,, we can conclude that the channel will be time invariant during the packet. 


For the Clarke-Jakes spectrum, the spaced-time correlation function can be computed as 
Rpoppler(Atime) = Jo(27 fmAtime) (5.341) 


where Jo(-) is the zero'"-order Bessel function. The spaced-time correlation function is plotted in 


Figure 5.35. The ripples in the temporal correlation function lead to rapid decorrelation but do show 
some longer-term correlations over time. 
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Figure 5.35 The spaced-time correlation function corresponding to the Clarke-Jakes spectrum 
plotted for f,, = 533Hz 


An interesting aspect of time-selective fading is that it depends on the carrier. Normally we use the 
baseband equivalent channel model and forget about the carrier f,. Here is one place where it 1s 


important. Note that the higher the carrier, the smaller the coherence time for a fixed velocity. This 
means that higher-frequency signals suffer more from time variations than lower-frequency signals. 


In practice, it is common to determine the spaced-time correlation function from measurements. For 
example, suppose that short training sequences are used to generate channel /[n, €] at time 1 over N 
measurements. The spaced-time correlation function may then be estimated as 

MN N-1 oo "L* , 
RDoppler[M1 — M2] = HF Mn=o XueWo Mla, Lh" |na, €] The Doppler spectrum could also be 
measured through estimating the power spectrum. 


5.7.4 Signal Models for Channel Selectivity 


The selectivity of a channel determines which signal processing channel model is appropriate. As a 
result of the decomposition of time and frequency selectivity, there are four regions of selectivity. In 
this section, we present typical models for each region and comment on the signal processing 
required at the receiver in each case. 


Time Invariant/Frequency Flat 


In this case, the equivalent system, including the channel, carrier frequency offset, and frame delay 
(supposing symbol synchronization has been performed), can be written as 


y[n] =e" hs[n — d] + v[n] (5.342) 


forn=0,1,..., M— 1. The signal processing steps required at the receiver were dealt with 
extensively in Section 5.1. 


Time Invariant/Frequency Selective 
In this case, the equivalent system, including the channel, carrier frequency offset, and delay, is 


g; 
y[n] = e!?™e" pe hlé)s[n — d — €] + v[n] (5.343) 


&=0 


VL 
forn =0, 1,..., Ni — 1 where the impulse response {hlé) be<o includes the effects of multipath, the 


transmit pulse shape, and receive matched filter, as well as any symbol synchronization errors. The 
signal processing steps required at the receiver were dealt with extensively in Section 5.2 through 
Section 5.4. 


Time Variant/Frequency Flat 


Assuming that the channel changes slowly with respect to the symbol period but changes faster than 
L. Mov 


y[n] = h[n|s{n — d] + v[n] (5.344) 
forn=0,1,..., N,4— 1. Ifthe channel changes too fast relative to 7, then the transmit pulse shape 


will be significantly distorted and a more complex linear time-varying system model will be 
required. We have incorporated the presence of a small carrier frequency offset into the time-varying 


channel; the presence of larger offsets would require a different model. 


One way to change the channel estimation and equalization algorithms in Section 5.1 is to include a 
tracking loop. The idea is to use periodically inserted pilots (at less than the coherence time) to 


exploit the correlation between the samples of /[n]. In this way, an estimate of h\n] canbe generated 
by using concepts of prediction and estimation, for example, a Wiener or Kalman filter [172, 143]. 
Another approach is to avoid channel estimation altogether and resort to differential modulation 
techniques like DQPSK. Generally, these methods have an SNR penalty over coherent modulations 
but have relaxed or no channel estimation requirements [156, 287]. 


Time Variant/Frequency Selective 


Assuming that the channel changes slowly with respect to the symbol period but changes faster than 
Io 


L 


£=0 
forn=0,1,..., M4 — 1. The channel is described by a two-dimensional linear time-varying system 


with impulse response {hin, él}z. -0, which is often called a doubly selective channel. We have again 
incorporated the presence of carrier frequency offset into the time-varying channel. If the channel 
changes a lot, then some additional Nyquist assumptions may need to be made and the discrete-time 
model would become much more complex (typically this is required only for extremely high 
Dopplers). 


The time- and frequency-selective channel is the most challenging from a signal processing 
perspective because the channel taps change over time. Operating under this kind of channel is 
challenging for two reasons. First, estimating the channel coefficients 1s difficult. Second, even with 
the estimated coefficients, the equalizer design is also challenging. One way to approach this problem 
is to use basis expansion methods [213] to represent the doubly selective channel as a function of a 
smaller number of more slowly varying coefficients. This parametric approach can help with channel 
estimation. Then modifications of OFDM modulations can be used that are more suitable for time- 
varying channels [320, 359]. Operating in the time- and frequency-selective region is common for 
underwater communication [92]. It is not yet common for terrestrial systems, though there is now 
growing interest by companies like Cohere Technologies [230]. 


5.8 Small-Scale Channel Models 


Because of the challenges associated with time selectivity, wireless communication systems are 
normally engineered such that the channel is time invariant over a packet, frame, block, or burst 
(different terminology for often the same thing). This is generally called fading. Even in such systems, 
though, the channel may still vary from frame to frame. In this section, we describe stochastic small- 
scale fading models. These models are used to describe how to generate multiple realizations of a 
channel for analysis or simulation. First, we review some models for flat fading, in particular 
Rayleigh, Ricean, and Nakagami fading. Then we review some models for frequency-selective 
fading, including a generalization of the Rayleigh model and the Saleh- Valenzuela clustered channel 
model. We conclude with an explanation about how to compute a bound on the average probability of 
symbol error in a flat-fading channel. 


5.8.1 Flat-Fading Channel Models 


In this section, we present different models for flat-fading channels. We focus on the case where the 
channel /, is a random variable that is drawn independently from frame to frame. We provide some 


explanation about how to deal with correlation over time at the end. 

The most common model for flat fading is the Rayleigh channel model. In this case, /, has 
distribution Vc(9, 1). The variance is selected so that ©||/ vs|7] = I ¢:0 that all gain in the channel is 
added by the large-scale fading channel. This is called the Rayleigh model because the envelope of 
the channel |h,| has the Rayleigh distribution, in this case given by Sn.) = 2x exp(—x’). The magnitude 
squared Af is the sum of the squares of two N(0, 1/2) random variables, giving it a (scaled due to the 
1/2) chi-square distribution. The channel phase (/,) is uniformly distributed on [0, 27]. The Rayleigh 


fading channel model is said to model the case where there is a rich scattering NLOS environment. In 
this case, paths arrive from all different directions with slightly different phase shifts, and based on 
the central limit theorem, the distribution converges to a Gaussian. 


Sometimes there is a dominant LOS path. In these cases, the Ricean channel model is used, 
parameterized by the K-factor. In the Ricean model, h, has distribution Ne(u, 07). The Rice factor is 
K = |p?/o?. Enforcing E[|hs|*] = ! then u? +o” = 1. Substituting for u? = o?K and simplifying leads 
to |H| = Vi /(1+ 4) and o?= 1/(1 + K). Then, in terms of K, h, has distribution 


Ne(e'/K/(1 + 4) where 6 = phase(), but in most cases 8 = 0 is selected. The Rice factor varies 
from K = 0 (corresponding to the Rayleigh case) to K = «© (corresponding to a non-fading channel). 


Other distributions are used as well, inspired by measurement data, to give a better fit with 
observed data. The most common is the Nakagami-m distribution, which is a distribution for |/,| with 


an extra parameter m, with m = | corresponding to the Rayleigh case. The phase of /, is taken to be 
uniform on [0, 27]. The Nakagami-m distribution is 
Imm 2m —1 


py a 5 346 
f(e) = = (5.346) 


When |i,| is Nakagami-m, |, has a gamma distribution '(m, 1/m). The gamma distribution with two 


parameters is also used to give further flexibility for approximating measured data. 


Time selectivity can also be incorporated into flat-fading models. This is most commonly done for 
the Rayleigh fading distribution. Let h,[] denote the small-scale fading distribution as a function of 


n. In this case, n could index symbols or frames, depending on whether the channel is being generated 
in a symbol-by-symbol fashion or a frame-by-frame fashion. For purposes of illustration, we generate 
the channel once per N,,, symbols. 


Suppose that we want to generate data with spatial correlation function Rpoppier(Atime). Let 
RD oppler/A] = Rpoppter(AT Mot) be the equivalent discrete-time correlation function. We can generate a 
Gaussian random process with correlation function RpoppierLA] by generating an IID Gaussian random 


process with Ne(9,1) and filtering that process with a filter g[k] such that Rpoppterlk] = g[A] * q* 
[—-k]. Such a filter can be found using algorithms in statistical signal processing [143]. It is often 
useful to implement the convolution by representing the filters in terms of their poles and zeros. For 
example, the coefficients of an all-pole IIR filter can be found by solving the Yule- Walker equations 
using the Levinson-Durbin recursion. The resulting channel would then be considered an 
autoregressive random process [19]. Other filter approximations are also possible, for example, 
using an autoregressive moving average process [25]. 


There are also deterministic approaches for generating random time-selective channels. The most 
widely known approach is Jakes’s sum-of-sinusoids approach [165], as a way to generate a channel 
with approximately the Clarke-Jakes Doppler spectrum. In this case, a finite number of sinusoids are 
summed together, with the frequency determined by the maximum Doppler shift and a certain 
amplitude profile. This approach was widely used with hardware simulators. A modified version 
with a little more randomness improves the statistical properties of the model [368]. 


5.8.2 Frequency-Selective Channel Models 


In this section, we present different models for frequency-selective fading channels, focusing on the 
case where the channel coefficients are generated independently for each frame. Unlike in the 
previous section, we present two classes of models. Some are defined directly in discrete time. 
Others are physical models generated in continuous time, then converted to discrete time. 


The discrete-time frequency-selective Rayleigh channel is a generalization of Rayleigh fading. In 
this model, the variance of the taps changes according to the specified symbol-spaced power delay 
profile Racyll] = Ractay(€7). In this model /,[€] is distributed as Ne(0, Raetaylé]), 4 special case of 
this model is the uniform power delay profile where /,[¢] is distributed as Nc (0, 1) which is often 
used for analysis because of its simplicity. Sometimes the first tap has a Ricean distribution to model 
an LOS component. 

L r Oe cen, K ; 

Note that the total power in the channel demo lhsléll” = dicio Raelay el which depending on 
the power delay profile may be greater than 1. This 1s realistic because the presence of multiple paths 
allows the receiver to potentially capture more energy versus a flat-fading channel. Sometimes, 
though, the power delay profile may be normalized to have unit energy, which is useful when making 
comparisons with different power delay profiles. 


Another class of frequency-selective channel models forms what are known as clustered channel 
models, the first of which is the Saleh- Valenzuela model [285]. This is a model for a continuous-time 


impulse response based ona set of distributions for amplitudes and delays and is a generalization of 
the single-path and two-path channels used as examples in Section 3.3.3. The complex baseband 
equivalent channel is 


h(t) — >.  axadol = Tn Tm,q) (5.347) 


m=0 q=0 


where T,,, is the cluster delay, o,, , 1s the complex path gain, and T,, , is the path delay. The discrete- 
time equivalent channel is then given using the calculations in Section 3.3.5 as 


git. wi 
hain] =F > S- ainwtltet ~Ta=— ed): (5.348) 


m=0 q=0 


The choice of g(t) depends on exactly where the channel is being simulated. If the simulation only 
Takes 
requires {hs\4]}r0 obtained after matched filtering and sampling, then choosing g(t) = g,,(0) * 2..(0) 


makes sense. If the channel is being simulated with oversampling prior to symbol synchronization and 
matched filtering, then alternatively it may make sense to replace g(t) with a lowpass filter with 
bandwidth corresponding to the bandwidth of x(¢). In this case, the discrete-time equivalent may be 
obtained through oversampling, for example, by 7/M.,. 


The Saleh-Valenzuela model is inspired by physical measurements that show that multipaths tend to 
arrive in clusters. The parameter T,, denotes the delay ofa cluster, and the corresponding t,, , denotes 
the g'" ray from the m" cluster. 


The cluster delays are modeled as a Poisson arrival process with parameter ®. This means that the 
interarrival distances 7, — T,,_; are independent with exponential distribution /(T7,,|7,,,,) = 
Mexp(—®(7,, — T,,;)). Similarly, the rays are also modeled as a Poisson arrival process with 
parameter 6, giving T,, ¢—T,,,e-; an exponential distribution. The parameters ® and $ would be 
determined from measurements. 

The gains 0,,, 4 
delay and the ray delay increase. Specifically, a, , is distributed as 


are complex Gaussian with a variance that decreases exponentially as the cluster 


; 7 = a 7 ; jrm 7 o— tx" — _ . 
Ne(0, exp(—Tm/T) exp(—Tm,q/T) where T and 7 are also parameters in the model. 


The Saleh-Valenzuela model is distinctive in that it facilitates simulations with two scales of 
randomness. The first scale is in the choice of clusters and rays. The second scale is in the variability 
of the amplitudes over time. A simulation may proceed as follows. First, the cluster delays {7,,? and 
the associated ray delays t,, , are generated from the Poisson arrival model. For this given set of 


clusters and rays, several channel realizations may be generated. Conditioned on the cluster delays 
and ray delays, these channel coefficients are generated according to the power delay profile 


OO :— {i ‘i Se > — Tr ,« /@ f, yr ; ‘ ‘ é ‘ a 7 
Ene : /°ult—Lm) where u(t) is the unit step function. In practical applications of 
this model, the number of rays and clusters is truncated, for example, ten clusters and 50 rays per 
cluster. 


The impulse-based channel model in (5.347) can be used in other configurations, with or without 
clusters. For example, given a model of a propagation environment, ray tracing could be used to 


determine propagation paths, gains, and delays between a transmitter and a receiver. This information 
could be used to generate a channel model through (5.347). The model could be updated by moving a 
user through an area. In another variation, the cluster locations are determined from ray tracing, but 
then the rays arriving from each cluster are assumed to arrive randomly with a certain distribution 
(usually specified by the angle spread) [37, 38]. Other variations are possible. 


There are different ways to incorporate mobility into a frequency-selective channel model. In the 
case of the discrete-time frequency-selective Rayleigh channel, the same approaches can be used as 
in the flat-fading case, but applied per tap. For example, each tap could be filtered so that every tap 
has a specified temporal correlation function (usually assumed to be the same). In the case of the 
Saleh- Valenzuela model, the clusters and rays could also be parameterized with an angle in space, 
and a Doppler shift derived from that angle. This could be used to model a time-varying channel 
given a set of clusters and rays. 


5.8.3 Performance Analysis with Fading Channel Models 


The coherence time and the coherence bandwidth, along with the corresponding signal bandwidth and 
sample period, determine the type of equivalent input-output relationship used for system design. 
Given a fading channel operating regime, an appropriate receiver can be designed based on system 
considerations, including the target error rate. The impact of fading on the system performance, 
though, depends on the discrete-time channel and the receiver processing used to deal with fading. 


In complicated system design problems, as is the case in most standards-based systems, 
performance of the system is typically estimated using Monte Carlo simulation techniques. 
Essentially, this involves generating a realization of the fading channel, generating a packet of bits to 
transmit, generating additive noise, and processing the corresponding received signal. 

For some special cases, though, it is possible to use a stochastic description of the channel to 
predict the performance without requiring simulation. This 1s useful in making initial system design 
decisions, followed up by more detailed simulations. In this section, we provide an example of the 
analysis of the probability of symbol error. 


Consider a flat-fading channel described as 
y(n] = VExhss{n] + v[n] (5.349) 


where we assume the large-scale fading is G = E,. When the channel is modeled as a random 
By | 


| | . ‘tenag Fe (Melts), 
variable, the instantaneous probability of symbol error rate writtenas “ \Ne' */is also a random 
variable. One measure of performance in this case is the average probability of symbol error. Other 


measures based on a notion of outage are also possible. The average probability of error is written as 


B,. E,. " , . 
P(e) = Bo. [Pe (Im) one 
Ex) 


P.(& hs). - - . 
where No '’/ is the probability of symbol error conditioned on a given value of /,. The 
expectation is taken with respect to all channels in the distribution to give 


Ex Ex 
P,(—) = | P.( = 
(ie) = LP ae 


Note that we indicate this as a single integral, but since the channel is complex, it is more general to 
have a double integral. Given a prescribed fading channel model and probability of symbol error 
expression, the expectation or a bound can sometimes be calculated in closed form. 


c) fh. (e)de. (5.351) 


In this section, we calculate the expectation of the union bound on the probability of symbol error 
in an AWGN channel. Using the union bound, 


> 
Ex |h|? dinin (5 352) 
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While solutions exist to help calculate the O(-) function, such as Craig’s formula [80], a simpler 
method is to use the Chernoff bound @(") < e~ = . This gives 


E, 1 Ex|hs|? 12 
Pe. ( "7 hs < 5 (M —1)e7~ 48 Amin, (5.353) 


To proceed, we need to specify a distribution for the channel. 
Suppose that the channel has the Rayleigh distribution. A way to evaluate (5.353) is to use the fact 
that h, 1s ¢ Ne(0,1) and that any distribution function integrates to 1. Then, substituting in for Sh, (c) =a 


! exp(—|c?), it follows that 


l Sxle|? 42 | 2 it 3 > Exlel*dz in 1 2 
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Putting it all together: 
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Example 5.34 
Compute the Chernoff upper bound on the union bound for Rayleigh fading with M-QAM. 


2 _ 6 
Answer: For M-QAM, we can insert dinin = M-—1 into (5.358) to obtain 
EB 1 1 
os ( 4 < 5(M il) (5.359) 
ial . AM—DN, + | 
This provides somewhat more intuition than the exact solution, computed in [310] as 
n(B)-2(-J 
\ No M 
1 1 
= = 5.360 


A comparison of both the exact and upper bounds is provided in Figure 5.36 for 4-QAM. Here it is 
apparent that the upper bound is loose but has the correct slope at high SNR. 
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Figure 5.36 Probability of error curve comparing Gaussian and Rayleigh fading for 4-QAM 


The implication of (5.358) is that the probability of error decreases as a function of the inverse of 


the SNR for Rayleigh fading channels. Note, though, that for the non-fading AWGN channel, the 
probability of error decreases exponentially (this can be visualized from the Chernoff upper bound). 
This means that fading channels require a much higher average SNR to achieve a given probability of 
error. The difference between the required SNR for an AWGN channel at a particular error rate and 
the target required for a fading channel is known as the small-scale fading margin. This is the extra 
power required to compensate for fading in the channel. An illustration is provided in Figure 5.36 for 
4-QAM. For example, at a symbol error rate of 10-*, 4dB of SNR are required for an AWGN channel 
but 14dB are required for the Rayleigh channel. This means that a 10dB small-scale fade margin 
would be required in Rayleigh fading compared to an AWGN channel. 


5.9 Summary 
¢ Single-path propagation channels delay and attenuate the received signal. 


¢ Multipath propagation channels create intersymbol interference. The discrete-time complex 
baseband equivalent channel includes the transmit and receive pulse shaping. It can be modeled 


NVL 
with FIR filter coefficients 1/*|4) }e<o, 


¢ Equalization is a means of removing the effects of multipath propagation. Linear equalization 
designs a filter that is approximately the inverse of the channel filter. Equalization can also be 
performed in the frequency domain using OFDM and SC-FDE frameworks. 


¢ Frame synchronization involves identifying the beginning of a transmission frame. In OFDM, 
frame synchronization is called symbol synchronization. Known training sequences or the 
periodic repetition of a training sequence can be used to estimate the beginning of the frame. 


¢ Carrier frequency offset is created by (small) differences in the carrier frequencies used at the 
transmitter and the receiver. This creates a phase rotated on the received signal. Carrier 
frequency offset synchronization involves estimating the offset and derotating the received 
signal to remove it. Special signal designs are used to enable frequency offset estimation prior 
to channel estimation. 


¢ Training sequences are sequences known to both the transmitter and the receiver. They are 
inserted to allow the receiver to estimate unknown parameters like the channel, the frame start, 
or the carrier frequency offset. Training sequences with good correlation properties are useful 
in many algorithms. 

¢ Propagation channel models are used to evaluate the performance of signal processing 
algorithms. Large-scale fading captures the average characteristics of the channel over 
hundreds of wavelengths, whereas small-scale fading captures the channel behavior on the 
order of wavelengths. Channel models exist for the large-scale and the small-scale fading 
components. 


¢ Path loss describes the average loss in signal power as a function of distance. It is normally 
measured in decibels. The log-distance path-loss model describes the loss as a function of the 
path-loss exponent and possibly an additional random variable to capture shadowing. The 
LOS/NLOS path-loss model has a distance-dependent probability function that chooses from an 
LOS or NLOS log-distance path-loss model. 


¢ The small-scale fading characteristics of a channel can be described through frequency 
selectivity and time selectivity. 


¢ Frequency selectivity is determined by looking at the power delay profile, computing the RMS 
delay spread, and seeing if it is significant relative to the symbol period. Alternatively, it can be 
assessed by looking at the coherence bandwidth and comparing the bandwidth of the signal. 


¢ Time selectivity is quantified by looking at the spaced-time correlation function and comparing 
the frame length. Alternatively, it can be determined from the RMS Doppler spread or the 
maximum Doppler shift and comparing the signal bandwidth. 


¢ There are several different flat-fading and frequency-selective channel models. Most models 
are stochastic, treating the channel as a random variable that changes from frame to frame. 
Rayleigh fading is the most common flat-fading channel model. Frequency-selective channels 
can be generated directly based on a description of their taps or based on a physical description 
of the channel. 


Problems 

1. Re-create the results from Figure 5.5 but with 16-QAM. Also determine the smallest value of 
M.,, such that 16-QAM has a 1dB loss at a symbol error rate of 107+. 

2. Re-create the results from Example 5.3 assuming training constructed from Golay sequences as 
[ag; be], followed by 40 randomly chosen 4-QAM symbols. Explain how to modify the 
correlator to exploit properties of the Golay complementary pair. Compare your results. 

3. Consider the system 


y(n] = hs{[n] + v[n] (5.361) 


where s[n] is a zero-mean WSS random process with correlation r,,[], v[m] is a zero-mean 
WSS random process with correlation r,,,[”], and s[n] and v[7] are uncorrelated. Find the 
linear MMSE equalizer g such that the mean squared error 1s minimized: 


E{le[n]|?] = E[|s{n] — g*y[n]|]. (5.362) 


(a) Find an equation for the MMSE estimator g. 


(b) Find an equation for the mean squared error (substitute your estimator in and compute the 
expectation). 


(c) Suppose that you know r,,[m] and you can estimate r,,.[n] from the received data. Show how 
to find r,,,[”] fromr,.[”] and ry [nm]. 

(d) Suppose that you estimate r,,[] through sample averaging of N samples, exploiting the 
ergodicity of the process. Rewrite the equation for g using this functional form. 

(ec) Compare the least squares and the MMSE equalizers. 


4. Consider a frequency-flat system with frequency offset. Suppose that 16-QAM modulation is 
used and that the received signal is 


y[n] = elmer \/ Exs{n] + v[n] (5.363) 


where € = forrset/s and exp(j27¢7) 1s unknown to the receiver. Suppose that the SNR is 10dB and 


the packet size is N = 101 symbols. The effect of ¢ is to rotate the actual constellation. 


(a) Consider y[0] and y[1] in the absence of noise. Illustrate the constellation plot for both 
cases and discuss the impact of ¢ on detection. 


(b) Over the whole packet, where is the worst-case rotation? 


(c) Suppose that ¢ is small enough that the worst-case rotation occurs at symbol 100. What is 
the value of ¢ such that the rotation is greater than 1/2? For the rest of this problem assume 
that ¢ is less than this value. 


(d) Determine the ¢ such that the symbol error rate is 10~°. To proceed, first find an expression 
for the probability of symbol error as a function of ¢«. Make sure that ¢ is included 
somewhere in the expression. Set equal to 10~? and solve. 

5. Let T be a Toeplitz matrix and let T* be its Hermitian conjugate. If T is either square or tall 
and full rank, prove that T*T is an invertible square matrix. 


6. Consider the training structure in Figure 5.37. Consider the problem of estimating the channel 
from the first period of training data. Let s[0], s[1], ..., s[,,-1] denote the training symbols, 


and let s[N,,], s[M, + 1], ...,s[N— 1] denote the unknown QAM data symbols. Suppose that we 


can model the channel as a frequency-selective channel with coefficients h[0], A[1],..., Ale]. 
The received signal (assuming synchronization has been performed) is 


L 
y(n] = D, hlé|s{n — £] + v[n]. (5.364) 


£=0 


Frame 


+ > < > 
Nt N-Nt 
Training Data symbols 
Figure 5.37 A frame structure with a single training sequence of length N, followed by N — N, data 
symbols 


(a) Write the solution for the least squares channel estimate from the training data. You can use 
matrices in your answer. Be sure to label the size of the matrices and their contents very 
carefully. Also list any critical assumptions necessary for your solution. 


(b) Now suppose that the estimated channel is used to estimate an equalizer that is applied to 


y[n], then passed to the detector to generate {s|n]} Ni ; : tentative symbol decisions. We 
would like to improve the detection process by using a decision-directed channel estimate. 
Write the solution for the least squares channel estimate from the training data and tentative 
symbol decisions. 


(c) Draw a block diagram for the receiver that includes synchronization, channel estimation 
from training, equalization, reestimation of the channel, reequalization, and the additional 
detection phase. 


(d) Intuitively, explain how the decision-directed receiver should perform relative to only 


training as a function of SNR (low, high) and W,, (small, large). Is there a benefit to multiple 
iterations? Please justify your answer. 


~ 


f 


7. Let H be the Toeplitz matrix that is used in computing the least squares equalizer *na. Prove 


that if 2!0] # 0. FF is full rank. 


8. Consider a digital communication system where the same transmitted symbol s[n] is repeated 
over two different channels. This is called repetition coding. Let h, and h, be the channel 


coefficients, assumed to be constant during each time instance and estimated perfectly. The 
received signals are corrupted by v,[] and v,[”], which are zero-mean circular symmetric 


complex AWGN of the same variance 07, that is, v,[7], v»[7] ~ NAO, 67). In addition, s[7] has 
zero mean and E|s[n]|? = 1, we assume that s[n], v,[1], and v,[7] are uncorrelated with each 
other. The received signals on the two channels are given by 
yi[n] = hys[n] + vy[n], (5.365) 
y2|[n] = heas|n] + va[n]. (5.366) 


Suppose that we use the equalizers g, and g, for the two time instances, where g, and g, are 


complex numbers such that |g)? + |g)? = 1. The combined signal, denoted as z[n], is formed by 
summing the equalized signals in two time instances: 


2[n| = giyi[n] + g2ye|n). (5.367) 
If we define the following vectors: 
y[n] = . : (5.368) 
y2|n 
h | 
h= ml (5.369) 
ho 
vy(n| - 
v[n| = (5.370) 
v2(n| 
g = e) (5.371) 
g2 
then 
y|n] = hs[n] + v[n] (5.372) 
and 


z[n] = g™ y(n]. (5.373) 


(a) Write an expression for z[”], first in terms of the vectors h and g, then in terms of g), g>, A, 
and /,. Thus you will have two equations for z[7]. 

(b) Compute the mean and the variance of the noise component in the combined signal z[7]. 
Remember that the noise is Gaussian and uncorrelated. 

(c) Compute the SNR of the combined signal z[] as a function of o, /,, ho, g), and g> (or o, h, 


and g). In this case the SNR 1s the variance of the combined received signal divided by the 
variance of the noise term. 

(d) Determine g, and g, as functions of /, and A, (or g as a function of h) to maximize the SNR 
of the combined signal. Hint: Recall the Cauchy-Schwarz inequality for vectors and use the 
conditions when equality holds. 

(e) Determine a set of equations for finding the LMMSE equalizers g, and g, to minimize the 
mean squared error, which is defined as follows: 


E||e[n}|? = E|s{n] — z[n]|?. (5.374) 


Simplify your equation by exploiting the orthogonality principle but do not solve for the 
unknown coefficients yet. Hints: Using the vector format might be useful, and you can assume 
you can interchange expectation and differentiation. First, expand the absolute value, then 
take the derivative with respect to g* and set the result equal to zero. Simplify as much as 
possible to get an expression of g as a function of 1, Ao, and o. 


(f) The autocorrelation matrix R,,[0] of y[7] is defined as 


; ™ ry.y,[0] Ty, y2(0} ve 
Ryy [0] = E(y[nly*[n]) = | ese (5.375) 
“= Tyoy [0] Vy2ye [0] 
Compute R,,[0] as a function of 6, hy, and hy. Hint: "ss [0] = B\s[n]|? = lang 
rab|O] = E(a{n]b"[n)) for random processes a[n] and b[n]. 
(g) Now solve the set of equations you formulated in part (e). 


9. Consider an order L-tap frequency-selective channel. Given a symbol period of 7 and after 
oversampling with factor M... and matched filtering, the received signal is 


- kT kT — 
rik S s[nlh | — —nT) +0 ’ 9.376 
r[k] sinh (5 nt) +0 (FF ] (5.376) 


n=—ox 


where s is the transmitted symbols and v is AWGN. Suppose a fractionally spaced equalizer 
(FSE) is applied to the received signal before downsampling. The FSE /[A] is an FIR filter with 
tap spacing of 7/M,, and length MN. Figure 5.38 gives the block diagram of the system. 


Fractionally 
spaced equalizer: 


fk] 


Figure 5.38 Block diagram of a receiver with FSE 


(a) Give an expression for w[x], the output of the FSE, and yj, [7], the signal after 
downsampling. 


(b) We can express the FSE filter coefficients in the following matrix form: 


f (0) fl am fT 
f(M] fiM+1) 0 f2M—]] | 
= . (5.377) 
FU(N-1)M] f[(N-1)M41] --- f[NM-]] 


We can interpret every (j + 1) column of F as a 7-spaced subequalizer fj[n] where 7 = 0, 2, 


.... M-1. Similarly, we can express the FSE output, the channel coefficients, and the noise as 
T-spaced subsequences: 


M Pan J r orc 

rin] =? hr —T WT | (5.378) 
M —7 

h;[n| = h |nT —T 2 (5.379) 
M-j : 

v;[n] = w ur —T Yj | ; (5.380) 

Using these expressions, show that 
r;[n| = s[n] « hj[n] + v;[n]. (5.381) 


(c) Now, express yyy [7] as a function of s[n], f[”], h;[7], and o,[7]. This expression is known 
as the multichannel model of the FSE since the output is the sum of the symbols convolved 
with the 7-spaced subequalizers and subsequences. 


(d) Draw a block diagram of the multichannel model of the FSE based on part (c). 
(e) Now consider a noise-free case. Given the channel matrix H; given by 


h{0| 0 vee 
h[M — J] hij Le 0 


hiM — j] "tl hij] 
. (5.382) 
A[(L —1)M — J] ve h{M — J] 
0 Al(L —1)M — j] 
h{(L —1)M — j] 
we define 
Bi’= [Fig Hi nes Higa] (5.383) 
f = (fyi Sion «> || (5.384) 


where fj is the ("+ 1) column of F. Perfect channel equalization can occur only if the solution 

to h= Cf, e = e, lies in the column space of H, and the channel H has all linearly independent 

rows. What are the conditions on the length of the equalizer N given M,,, and L to ensure that the 

latter condition is met? Hint: What are the dimensions of the matrices/vectors? 

(f) Given your answer in (e), consider the case of M/ = 1 (1.e., not using an FSE). Can perfect 
equalization be performed in this case? 


10. Consider an order L-tap frequency-selective channel. After matched filtering, the received 
signal is 


y|n| = sa hlé|s{n — £] + v[n, (5.385) 


£=0 


where s[7] is the transmitted symbols, /[€] is the channel coefficient, and v[7] is AWGN. 
Suppose the received symbols corresponding to a known training sequence {1, —1, —-1, 1} are 
{0.75 + j0.75, —0.75 — 0.25, —0.25 — 70.25, 0.25 + 70.75}. Note that this problem requires 
numerical solutions; that is, when you are asked to solve the least squares problems, the results 
must be specific numbers. MATLAB, LabVIEW, or MathScript in LabVIEW might be useful for 
solving the problem. 


YL 
(a) Formulate the least squares estimator of {h 4] }¢0 in the matrix form. Do not solve it yet. 
Be explicit about your matrices and vectors, and list your assumptions (especially the 
assumption about the relationship between L and the length of the training sequence). 


(b) Assuming L = 1, solve for the least squares channel estimate based on part (a). 


(c) Assume we use a linear equalizer to remove the eff ects of the channel. Let { f(4) he =0 be an 
FIR equalizer. Let ng be the equalizer delay. Formulate the least squares estimator of 


Ae 
{flél }e=0 given the channel estimate in part (b). Do not solve it yet. 
(d) Determine the range of values of nq. 


(e) Determine the ng that minimizes the squared error J,ng] and solve for the least squares 
equalizer corresponding to this ny. Also provide the value of the minimum squared error. 
(f) With the same assumptions as in parts (a) through (e) and with the value of ng found in part 
(e), formulate and solve for the direct least squares estimator of the equalizer’s coefficients 
{flé) be =0. Also provide the value of the squared error. 
11. Prove the following two properties of the DFT: 
(a) Let x[n] <> X[k] and x,[n] — X,[A]. If.X{ [A] = e?™*""X[k], we have 


i= rl((n-—m))n] O0O<n<N-1 (5.386) 
aie 0 otherwise. ey 


(b) Let y[n] <> Y[K], h[n] <> H[K] and s[n] <> S[A]. If Y[A] = A[A]S[A], we have 
yn] = Deo h[As{((n — 4))w] 
12. Consider an OFDM system. Suppose that your buddy C.P. uses a cyclic postfix instead of a 
cyclic prefix. Thus 


w[n] = — >. s[k]e)27° (5.387) 
forn=0,1,...,N+L, where L, is the length of the cyclic prefix. The values of w[n] for n < 0 
and n > N+ L, are unknown. Let the received signal be 


L 
y(n] = os hléjwi[n — €| + vin]. (5.388) 


(a) Show that you can still recover {s[h' he =0 with a cyclic postfix instead of a cyclic prefix. 
(b) Draw a block diagram of the system. 
(c) What are the differences between using a cyclic prefix and a cyclic postfix? 
13. Consider an SC-FDE system with equivalent system yn] = Saw h\é|s|n—€)+v[n] The 
length of the cyclic prefix is L,. Prove that the cyclic prefix length should satisfy L, = L. 
14. Consider an OFDM system with N = 256 subcarriers in SMHz of bandwidth, with a carrier of 
Jf. = 2GHz and a length ZL = 16 cyclic prefix. You can assume sinc pulse shaping. 
(a) What is the subcarrier bandwidth? 
(b) What is the length of the guard interval? 


(c) Suppose you want to make the OFDM symbol periodic including the cyclic prefix. The 
length of the period will be 16. Which subcarriers do you need to zero in the OFDM 
symbol? 


(d) What is the range of frequency offsets that you can correct using this approach? 


15. Consider an OFDM system with N subcarriers. 
(a) Derive a bound on the bit error probability for 4-QAM transmission. Your answer should 
depend on h[k], No, and diyin- 
(b) Plot the error rate curves as a function of SNR for N= 1, 2, 4, assuming that h[0] = £,, h[1] 
= E,/2, h[2] =—JE,/2, and h[3] = E,e ?"3/3. 
16. How many pilot symbols are used in the OFDM symbols during normal data transmission (not 
the CEF) in IEEE 802.11a? 


17. IEEE 802.1lad is a WLAN standard operating at 60GHz. It has much wider bandwidth than 
previous WLAN standards in lower-frequency bands. Four PHY formats are defined in IEEE 
802.1lad, and one of them uses OFDM. The system uses a bandwidth of 1880MHz, with 512 
subcarriers and a fixed 25% cyclic prefix. Now compute the following: 


(a) What is the sample period duration assuming sampling at the Nyquist rate? 
(b) What is the subcarrier spacing? 

(c) What is the duration of the guard interval? 

(d) What is the OFDM symbol period duration? 


(e) In the standard among the 512 subcarriers, only 336 are used as data subcarriers. Assuming 
we use code rate 1/2 and QPSK modulation, compute the maximum data rate of the system. 


[p\yL 
18. Consider an OFDM communication system and a discrete-time channel with taps {h{f]} é=0. 
Show mathematically why the cyclic prefix length ZL, must satisfy L > Ly. 


19. In practice, we may want to use multiple antennas to improve the performance of the received 
signal. Suppose that the received signal for each antenna can be modeled as 


Zi\n| = > hy[é|s[n — €] + vi [n] (5.389) 


£=0 


r2[n] = = h2[é\s[n — £] + ve|[n]. (5.390) 


£=0 


Essentially you have two observations of the same signal. Each is convolved by a different 
discrete-time channel. 


(A) (A) rz, 
In this problem we determine the coefficients of a set of equalizers 91 Ik] and 92 “(kl such 
that 


K 


oO tk ala ln — WS ol kl|ha{n — k] = 6[n — Al (5.391) 


k=0 k=0 
where A is a design parameter. 


ef Neral 
(a) Suppose that you send training data {t|n]} n=0 . Formulate a least squares estimator for 


bp) L 
finding the estimated coefficients of the channel {hi [é]} é=0 and {h 2 [E| be =(), 


(b) Formulate the least squares equalizer design problem given your channel estimate. Hint: 
You need a squared error. Do not solve it yet. Be explicit about your matrices and vectors. 


(c) Solve for the least squares equalizer estimate using the formulation in part (b). You can use 
matrices in your answer. List your assumptions about dimensions and so forth. 


(d) Draw a block diagram of a QAM receiver that includes this channel estimator and 
equalizer. 


(ec) Now formulate and solve the direct equalizer estimation problem. List your assumptions 
about dimensions and so forth. 


(f) Draw a block diagram of a QAM receiver that includes this equalizer. 


20. Consider a wireless communication system with a frame structure as illustrated in Figure 5.39. 
Training is interleaved around (potentially different) bursts of data. The same training sequence 
is repeated. This structure was proposed relatively recently and is used in the single-carrier 
mode in several 60GHz wireless communication standards. 


training data 1 training data 2 eee 


Figure 5.39 Frames with training data 


11a! 
Let the training sequence be {t\n}}n“o and the data symbols be {s[n]}. Just to make the 
problem concrete, suppose that 


t|n| n€ [0, Mr — 1] 
sin — Nel né(N.,M+N-] 
wn] = tn-(N_+N)] ne€([M+N,2Mr+N—-1]) >. (5.392) 
sin-(2Nr+N)| ne2NrtNnMN+N-I 
etc. 


Suppose that the channel is linear and time invariant. After matched filtering, synchronization, 
and sampling, the received signal is given by the usual relationship 


L 
y(n] = > h[{é}w[n — €] + v[n]. (5.393) 
£=0 
Assume that V is much greater than N,,, and that N,, = L. 
‘ 2M r + N _ 1 . . 

(a) Consider the sequence {w[n]} =o . Show that there exists a cyclic prefix of length 

Ne 
(b) Derive a single-carrier frequency-domain equalization structure that exploits the cyclic 

5 rece ace 
prefix we have created. Essentially, show how we can recover {s[n] } n=0 from {y[n]}. 


(c) Draw a block diagram for the transmitter. You need to be as explicit as possible in 
indicating how the training sequence gets incorporated. 


(d) Draw a block diagram for the receiver. Be careful. 
(ec) Suppose that we want to estimate the channel from the training sequence using the training 


on either side of the data. Derive the least squares channel estimator and determine the 
minimum value of N,, required for this estimator to satisfy the conditions required by least 
squares. 


(f) Can you use this same trick with OFDM, using training for the cyclic prefix? Explain why or 
why not. 


21. Consider an OFDM communication system. Suppose that the system uses a bandwidth of 
40MHz, 128 subcarriers, and a length 32 cyclic prefix. Suppose the carrier frequency is 
5.785GHz. 


(a) What is the sample period duration assuming sampling at the Nyquist rate? 
(b) What is the OFDM symbol period duration? 

(c) What is the subcarrier spacing? 

(d) What is the duration of the guard interval? 


(e) How much frequency offset can be corrected, in hertz, using the Schmidl-Cox method with 
all odd subcarriers zeroed, assuming only fine offset correction? Ignore the integer offset. 
(f) Oscillators are specified in terms of their offset in parts per million. Determine how many 
parts per million of variation are tolerable given the frequency offset in part (e). 
22. Consider an OFDM communication system with cyclic prefix of length L,. The samples 
conveyed to the pulse-shaping filter are 


1 fy fe Min—he) : Fak one Sane 
: S° s|mle'*" : n=0,....N+2,.-1. (5.394) 


w|n| = 


Recall that 
w([n] = w[n+ N], forn=0,...,L.—-—1 (5.395) 


is the cyclic prefix. Suppose that the cyclic prefix is designed such that the channel order L 
satisfies L, = 2L. In this problem, we use the cyclic prefix to perform carrier frequency offset 


estimation. Let the receive signal after match filtering, frame offset correction, and 
downsampling be written as 


L 
y(n} = garam >. hléjw[n — €) + v[n], for n = 0,..., N+1,-1. (5.396) 


£=() 


(a) Consider first a single OFDM symbol. Use the redundancy in the cyclic prefix to derive a 
carrier frequency offset estimator. Hint: Exploit the redundancy in the cyclic prefix but 
remember that L, = 2L. You need to use the fact that the cyclic prefix is longer than the 


channel. 
(b) What is the correction range of your estimator? 


(c) Incorporate multiple OFDM symbols into your estimator. Does it work if the channel 
changes for different OFDM symbols? 


23. Synchronization Using Repeated Training Sequences and Sign Flipping Consider the 


framing structure illustrated in Figure 5.40. This system uses a repetition of four training 
sequences. The goal of this problem is to explore the impact of multiple repeated training 
sequences on frame synchronization, frequency offset synchronization, and channel estimation. 


si || sti 


<~—., ~=~+ > «+> +> 


Ne Nt Nt Ne N-4N_ data symbols ee 
Training 
Figure 5.40 A communication frame with four repeated training sequences followed by data 
symbols 


(a) Suppose that you apply the frame synchronization, frequency offset estimation, and channel 
estimation algorithms using only two length N,. training sequences. In other words, ignore the 
two additional repetitions of the training signal. Please comment on how frame 
synchronization, frequency offset estimation, and channel estimation work on the 
aforementioned packet structure. 


(b) Now focus on frequency offset estimation. Treat the framing structure as a repetition of two 
length 2N,, training signals. Present a correlation-based frequency offset estimator that uses 
two length 2N,. training signals. 

(c) Treat the framing structure as a repetition of four length N,, training signals. Propose a 
frequency offset estimator that uses correlations of length N,, and exploits all four training 
signals. 

(d) What range of frequency offsets can be corrected in part (b) versus part (c)? Which 1s better 
in terms of accuracy versus range? Overall, which approach is better in terms of frame 
synchronization, frequency offset synchronization, and channel estimation? Please justify 
your answer. 

(e) Suppose that we flip the sign of the third training sequence. Thus the training pattern 


becomes 7) T -T; T instead of T 7 T T. Propose a frequency offset estimator that uses 
correlations of length N,, and exploits all four training signals. 


(f) What range of frequency offsets can be corrected in this case? From a frame synchronization 
perspective, what is the advantage of this algorithm versus the previoius algorithm you 
derived? 


24. Synchronization in OFDM Systems Suppose that we would like to implement the frequency 
offset estimation algorithm. Suppose that our OFDM systems operate with N = 128 subcarriers 
in 2MHz of bandwidth, with a carrier of f, = 1GHz and a length L = 16 cyclic prefix. 


(a) What is the subcarrier bandwidth? 


(b) We would like to design a training symbol that has the desirable periodic correlation 
properties that are useful in the discussed algorithm. What period should you choose and 
why? Be sure to consider the effect of the cyclic prefix. 

(c) For the period you suggest in part (b), which subcarriers do you need to zero in the OFDM 
symbol? 

(d) What is the range of frequency offsets that you can correct using this approach without 


requiring the modulo correction? 
25. Consider the IEEE 802.11a standard. By thinking about the system design, provide plausible 
explanations for the following: 
(a) Determine the amount of carrier frequency offset correction that can be obtained from the 
short training sequence. 
(b) Determine the amount of carrier frequency offset correction that can be obtained from the 
long training sequence. 
(c) Why do you suppose the long training sequence has a double-length guard interval followed 
by two repetitions? 
(d) Why do you suppose that the training comes at the beginning instead of at the middle of the 
transmission as in the mobile cellular system GSM? 
(ec) Suppose that you use 10MHz instead of 20MHz in IEEE 802.11a. Would you rather change 
the DFT size or change the number of zero subcarriers? 
26. Answer the following questions for problem 17: 
(a) How much frequency offset (in Hertz) can be corrected using the Schmidl-Cox method with 
all odd subcarriers zeroed, assuming only fine offset correction? Ignore the integer offset. 
(b) Oscillator frequency offsets are specified in parts per million (ppm). Determine how many 
parts per million of variation are tolerable given the result in part (a). 


27. GMSK is an example of continuous-phase modulation (CPM). Its continuous-time baseband 
transmitted signal can be written as 


g(t) = el F Daa — a all d(t-n1) (5.397) 


ro|4 


where a[n] is a sequence of BPSK symbols and $(f) is the CPM pulse. For GSM, T= 
6/1.625e6 ~ 3.69us. 


The BPSK symbol sequence a[n] is generated from a differentially binary (0 or 1) encoded 
data sequence d[n] where 


aln] = 1 — 2(d[n] @ d[n — 1}) (5.398) 
where © denotes modulo-2 addition. Let us denote the BPSK encoded data sequence as 
b[n] = 1 — 2(d[n)). (5.399) 
Then (5.398) can be rewritten in a simpler form as 
a(n] = b[n|b[n — 1]. (5.400) 


Consider the Gaussian pulse response 


g(t) = By = en er. (5.401) 


Denote the rectangle function of duration 7 in the usual way as 


t 1 lth< 5 
‘ect [ — } = 5.402 
= @ 0 elsewhere. ( 


For GSM, BT = 0.3. This means that B = 81.25kHz. The B here is not the bandwidth of the 
signal; rather it is the 3dB bandwidth of the pulse g(Z). 


The combined filter response is 


l a 
h(t) = g(t) * rect(t/T) = ii / g(t — T)dr. (5.403) 
J—-T/2 
Then the CPM pulse is given by 
t 
p(t) =| h(r)dr. (5.404) 
ea 
Because of the choice of BT product, it is recognized that 
0, t<0Oh 
b(t) = % 5.405 
o(t) 1 t>47. (5.405) 


This means that the current GMSK symbol a[n] depends most strongly on the three previous 
symbols a[n — 1], a[n — 2], and a[n — 3]. 

Note that the phase output o(¢) depends on all previous bits d[n] because of the infinite 
integral. Consequently, the GSM modulator is initialized by the state give in Figure 5.41. The 
state is reinitialized for every transmitted burst. 


dummy bits | 3 tail bits 3 tail bits dummy bits 
suena 111111 baat renee TUT TT Cees 


Output phase 


The useful part 


1/2 bit 1/2 bit 


The active part 


Figure 5.41 GMSK is a modulation with memory. This figure based on [99] shows how each burst 
is assumed to be initialized. 


The classic references for linearization are [171, 173], based on the pioneering work on 
linearizing CPM signals in [187]. Here we summarize the linearization approach of [171, 173], 


which was used in [87] for the purpose of blind channel equalization of GMSK signals. We 
sketch the idea of the derivation from [87] here. 


(a) Exploiting the observation in (5.405), argue that 
s(t) = eb? Un=-w alnJo(t—nT) (5.406) 


n 
wy fF Dat alk Il oi Falk] o(t-kT) (5.407) 
k=n-3 


for t © [nT (n+1)7). This makes the dependence of s(t) on the current symbol and past three 
symbols clearer. 


(b) Explain how to obtain 


el Falk] O(t-kT) — cog (Salkjott — kT)) +jsin (Falrlote kT)) . (5.408) 


(c) Using the fact that a[m] are BPSK modulated with values +1 or —1, and the even property of 
cosine and the odd property of sine, show that 


eiFalklO(t-kT) _ oo (=o(t 7 kT)) + ja{k] sin & ti KT)). (5.409) 


(d) Now let 
=k £40 
sen (t) = ; ¢=0 ; (5.410) 
L C20 


Note that for any real number ¢ = sgn(‘) |¢|. Now define 


B(t) = cos (Ssen(¢)9(¢)) (5.411) 
It can be shown exploiting the symmetry of (7) that 

cos (=0(t)) = B(t) (5.412) 

sin (S9(¢)) = p(t — 47). (5.413) 


Substitute in for B to obtain 
s(t) el F Dkai wall [] (@@-k7) +Jjaln]a(t-— kT -47)). (5.414) 
k=n-3 


(e) Show that 


o@ 

a Da ag|n|co(t — nT’) (5.415) 

where 

12 Lik=-x~ UA] — ja[njag[n —1] = —alnja[n — 1Jao[n — 2] 
(5.416) 


ag|n] := e 


and 
co(t) = B(t —T)B(t — 2T) A(t — 3T) B(t — 4T) (5.417) 


for t © [0, 57]. This is called the one-term approximation. 
(f) Modify the typical transmit block diagram to implement the transmit waveform in (5.415). 


(g) Modify the typical receiver block diagram to implement the receiver processing in a 
frequency-selective channel. 


28. Plot the path loss for a distance from 1m to 200m for the following channel models and given 
parameters. The path loss should be plotted in decibels. 


(a) Free space assuming G, = G, = 3dB, and A = 0.1m 
(b) Mean log distance using a Imreference with free-space parameters as in part (a) with path- 
loss exponent B = 2 


(c) Mean log distance using a 1m reference with free-space parameters as in part (a) with path- 
loss exponent B = 3 


(d) Mean log distance using a Imreference with free-space parameters as in part (a) with path- 
loss exponent B = 4 


29. Plot the path loss at a distance of 1km for wavelengths from 10m to 0.01m for the following 
channel models and given parameters. The path loss should be plotted in decibels and the 
distance plotted on a log scale. 


(a) Free space assuming G, = G, = 3dB 
(b) Mean log distance using a Imreference with free-space parameters as in part (a) with path- 
loss exponent B = 2 


(c) Mean log distance using a 1m reference with free-space parameters as in part (a) with path- 
loss exponent B = 3 
(d) Mean log distance using a Imreference with free-space parameters as in part (a) with path- 
loss exponent B = 4 
30. Consider the path loss for the log-distance model with shadowing for 6,,,4 = 8dB, n = 4, a 
reference distance of 1m with reference loss computed from free space, assuming G, = G, = 


3dB, for a distance from 1m to 200m. Plot the mean path loss. For every value of d you plot, 
also plot the path loss for ten realizations assuming shadow fading. Explain what you observe. 
To resolve this issue, some path-loss models have a distance-dependent 6,),,4. 


31. Consider the LOS/NLOS path-loss model with P,,,(d) = e 4/200 free space for the LOS path 


loss, log distance without shadowing for the NLOS with B = 4, reference distance of 1m, G, = 
G, = 0dB, and A = 0.1m. Plot the path loss in decibels for distances from | to 400m. 

(a) Plot the LOS path loss. 

(b) Plot the NLOS path loss. 

(c) Plot the mean path loss. 


(d) For each value of distance in your plot, generate ten realizations of the path loss and 
overlay them on your plot. 


(ec) Explain what happens as distances become larger. 

32. Consider the path loss for the log-distance model assuming that B = 3, with reference distance 
Im, G, = G, = 0dB and A = 0.1m, and o,,,4 = 6dB. The transmit power is P,, = 10dBm. 
Generate your plots from 1 to 500m. 

(a) Plot the received power based on the mean path-loss equation. 


(b) Plot the probability that P.,(d) <—110dBm. 
(c) Determine the maximum value of d such that P,,(d) > —110dBm 90% of the time. 


33. Consider the LOS/NLOS path-loss model with P,,.(d) = e 4/200 free space for the LOS path 
loss, log distance without shadowing for the NLOS with B = 4, reference distance of 1m, G, = 
G, = 0dB, and A = 0.1m. The transmit power is P,, = 10dBm. Generate your plots from | to 
500m. 

(a) Plot the received power based on the mean path-loss equation. 
(b) Plot the probability that P.,(d) < —110dBm. 


(c) Determine the maximum value of d such that P,.(d) > —110dBm 90% of the time. 


34. Consider the same setup as in the previous problem but now with shadowing on the NLOS 
component with o,,,4 = 6dB. This problem requires some extra work to include the shadowing 
in the NLOS part. 


(a) Plot a realization of the received path loss here. Do not forget about the shadowing. 
(b) Plot the probability that P.,(d) <—110dBm. 


(c) Determine the maximum value of d such that P,.(d) > —110dBm 90% of the time. 


35. Consider a simple cellular system. Seven base stations are located in a hexagonal cluster, with 
one in the center and six surrounding. Specifically, there is a base station at the origin and six 
base stations located 400m away at 0°, 60°, 120°, and so on. Consider the path loss for the log- 
distance model assuming that B = 4, with reference distance 1m, G,= G, = OdB, and A = 0.1m. 
You can use 290K and a bandwidth of B = 1OMHz to compute the thermal noise power as ATB. 
The transmit power is P,, = 40dBm. 


(a) Plot the SINR for a user moving along the 0° line froma distance of 1m from the base 
station to 300m. Explain what happens in this curve. 


(b) Plot the mean SINR for a user froma distance of 1m from the base station to 300m. Use 
Monte Carlo simulations. For every distance, generate 100 user locations randomly on a 
circle of radius d and average the results. How does this compare with the previous curve? 


36. Link Budget Calculations In this problem you will compute the acceptable transmission range 
by going through a link budget calculation. You will need the fact that noise variance 1s ATB 
where k is Boltzmann’s constant (i.e., k = 1.381 x 10°W/Hz/K), T is the effective temperature 
in kelvins (you can use 7’ = 300K), and B is the signal bandwidth. Suppose that field 
measurements were made inside a building and that subsequent processing revealed that the 
data fit the log-normal model. The path-loss exponent was found to be n = 4. Suppose that the 
transmit power is 5}0mW, 10mW is measured at a reference distance dy = 1m from the 


transmitter, and o = 8dB for the log-normal path-loss model. The desired signal bandwidth is 
1MHz, which is found to be sufficiently less than the coherence bandwidth; thus the channel is 
well modeled by the Rayleigh fading channels. Use 4-QAM modulation. 


(a) Calculate the noise power in decibels referenced to 1 milliwatt (dBm). 

(b) Determine the SNR required for the Gaussian channel to have a symbol error rate of 10°. 
Using this value and the noise power, determine the minimum received signal power 
required for at least 10 * operation. Let this be P,,,;, (in decibels as usual). 


(c) Determine the largest distance such that the received power is above Pn. 


(d) Determine the small-scale link margin Lpayieigh for the flat Rayleigh fading channel. The 
fade margin is the difference between the SNR required for the Rayleigh case and the SNR 
required for the Gaussian case. Let Praytigh = LRayleigh + Pmin: 


(e) Determine the large-scale link margin for the log-normal channel with a 90% outage. In 
other words, find the received power Piyrge-scale Fequired such that Piyroe-scale 1S greater than 
Prayleigh 90% of the time. 


(f) Using Piarge-scales determine the largest acceptable distance in meters that the system can 
support. 


(g) Ignoring the effect of small-scale fading, what is the largest acceptable distance in meters 
that the system can support? 
(h) Describe the potential impact of diversity on the range of the system. 


37. Suppose that field measurements were made inside a building and that subsequent processing 
revealed that the data fit the path-loss model with shadowing. The path-loss exponent was 
found to be n = 3.5. If lmW was measured at dy = 1m from the transmitter and at a distance of 
dy = 10m, 10% of the measurements were stronger than —25dBm, find the standard deviation 
Oshaq for the log-normal model. 


38. Through real measurement, the following three received power measurements were obtained at 
distances of 100m, 400m, and 1.6km from a transmitter: 


Distance from Transmitter | Received Power 
100m OdBm 


400m —20dBm 
1.6km —50dBm 


(a) Suppose that the path loss follows the log-distance model, without shadowing, with 


reference distance dy) = 100m. Use least squares to estimate the path-loss exponent. 
(b) Suppose that the noise power is —90dBm and the target SNR is 10dB. What is the minimum 
received signal power required, also known as the receiver sensitivity? 


(c) Using the value calculated in part (b), what is the approximate acceptable largest distance 
in kilometers that the system can support? 

39. Classify the following as slow or fast and frequency selective or frequency flat. Justify your 
answers. In some cases you will have to argue why your answer is correct by making 
reasonable assumptions based on the application. Assume that the system occupies the full 
bandwidth listed. 


(a) A cellular system with carrier frequency of 2GHz, bandwidth of 1.25MHz, that provides 
service to high-speed trains. The RMS delay spread is 2us. 


(b) A vehicle-to-vehicle communication system with carrier frequency of 8300MHz and 
bandwidth of 100kHz. The RMS delay spread is 20ns. 


(c) ASG communication system with carrier frequency of 3.7GHz and bandwidth of 200MHz. 


(d) A 60GHz wireless personal area network with a bandwidth of 2GHz and an RMS delay 
spread of 40ns. The main application is high-speed multimedia delivery. 


(ec) Police-band radio. Vehicles move at upwards of 100mph and communicate with a base 
station. The bandwidth is 50kHz at 900MHz carrier. 


40. Consider a wireless communication system with a frequency carrier of f, = 1.9GHz and a 


bandwidth of B = SMHz. The power delay profile shows the presence of three strong paths and 
is plotted in Figure 5.42. 


Power [dB] 


2 


Excess delay *{°<s| 
Figure 5.42 The power delay profile of a three-path channel 


(a) Compute the RMS delay spread. 


(b) Consider a single-carrier wireless system. Classify the system as frequency flat or 
frequency selective and slow or fast. Assume the system provides service to vehicle-to- 
vehicle communication at the speed of v = 120kn/h and not packet transmission but symbol 
transmission. You should use the stricter relationship between coherence time T,,,}, and 


maximum Doppler frequency f,, by 


(c) Consider an OFDM system with IFFT/FFT size of N = 1024. What is the maximum-length 
cyclic prefix needed to allow for efficient equalization in the frequency domain with 
OFDM? 


(d) For the specified OFDM system, which also provides services to vehicle-to-vehicle 
communication at urban speeds of 30mph, is the channel slow or fast fading? Justify your 
answer. 


41. Suppose that Rgcpy(t) = | for tT = [0, 20ps] and is zero otherwise. 
(a) Compute the RMS delay spread. 
(b) Compute the spaced-frequency correlation function S gejay(Ajg)- 


(c) Determine if the channel is frequency selective or frequency flat for a signal with bandwidth 


B= 1MuHz. 
(d) Find the largest value of bandwidth such that the channel can be assumed to be frequency 
flat. 
42. Suppose that delay(T) = €~ eRe) ge ir — Bus) p. € [0, 
20us]. 


(a) Compute the RMS delay spread. 
(b) Compute the spaced-frequency correlation function S gejay(Ayg)- 


(c) Determine if the channel is frequency selective or frequency flat for a signal with bandwidth 
B= 1MuHz. 

(d) Find the largest value of bandwidth such that the channel can be assumed to be frequency 
flat. 


43. Sketch the design of an OFDM system with 80MHz channels, supporting pedestrian speeds of 
3km/h, an RMS delay spread of 81s, operating at f. = 800MHz. Explain how you would design 


the preamble to permit synchronization and channel estimation and how you would pick the 
parameters such as the number of subcarriers and the cyclic prefix. Justify your choice of 
parameters and explain how you deal with mobility, delay spread, and how much frequency 
offset you can tolerate. You do not need to do any simulation for this problem; rather make 
calculations as required and then justify your answers. 


44. Sketch the design of an SC-FDE system with 2GHz channels, supporting pedestrian speeds of 
3km/h, and an RMS delay spread of 50ns, operating at f, = 64GHz. Explain how you would 
design the preamble to permit synchronization and channel estimation and how you would pick 
the parameters. Justify your choice of parameters and explain how you deal with mobility, delay 
spread, and how much frequency offset you can tolerate. You do not need to do any simulation 
for this problem; rather make calculations as required and then justify your answers. 

45. Consider an OFDM system with a bandwidth of W= 10MHz. The power delay profile 
estimates the RMS delay spread to be Opus delay = SHS. Possible IFFT/FFT sizes for the system 


are N= {128, 256, 512, 1024, 2048}. Possible cyclic prefix sizes (in terms of fractions of NV) 
are {1/4, 1/8, 1/16, 1/32}. 


(a) What combination of FFT and cyclic prefix sizes provides the minimum amount of 
overhead? 


(b) Calculate the fraction of loss due to this overhead. 
(c) What might be some issues associated with minimizing the amount of overhead in this 
system? 
46. Compute the maximum Doppler shift for the following sets of parameters: 
(a) 40MHz of bandwidth, carrier of 2.4GHz, and supporting 3km/h speeds 
(b) 2GHz of bandwidth, carrier of 64GHz, and supporting 3km/h speeds 
(c) 20MHz of bandwidth, carrier of 2.1GHz, and supporting 300km/h speeds 
47. In fixed wireless systems, the Clarke-Jakes spectrum ~ not be appropriate. A better model is 


SDo »ple (f) a CP eT ay | 
the Bell Doppler spectrum where _— 1+Kp(7) has been found to fit to data 


[363] where Kg 1s a constant so that | Spoppter(Aof = 1. In this case, the maximum Doppler shift 
is determined from the Doppler due to moving objects in the environment. 
(a) Determine Ky for a maximum velocity of 160kn/h and a carrier frequency of 1.9GHz. 
(b) Determine the RMS Doppler spread. 
(c) Is a signal with bandwidth of B = 20MHz and blocks of length 1000 symbols well modeled 
as time invariant? 

48. Look up the basic parameters of the GSM cellular system. Consider a dual-band phone that 

supports f, = 900MHz and f, = 1.8GHz. What is the maximum velocity that can be supported on 


each carrier frequency so that the channel is sufficiently constant during a burst period? Explain 
your work and list your assumptions. 


49. Look up the basic parameters of an IEEE 802.11a system. What is the maximum amount of 
delay spread that can be tolerated by an IEEE 802.11a system? Suppose that you are in a square 
room and that the access point is located in the middle of the room. Consider single reflections. 
How big does the room need to be to give this maximum amount of channel delay? 

50. Consider 4-QAM transmission. Plot the following for error rates of 1 to 10> and SNRs of 0dB 
to 30dB. This means that for higher SNRs do not plot values of the error rate below 10°. 
¢ Gaussian channel, exact expression 
¢ Gaussian channel, exact expression, neglecting the quadratic term 
¢ Gaussian channel, exact expression, neglecting the quadratic term and using a Chernoff 

upper bound on the Q-function 
¢ Rayleigh fading channel, Chernoff upper bound on the Q-function 

51. Computer In this problem, you will create a flat-fading channel simulator function. The input 
to your function 1s x(7/M,,), the sampled pulse-amplitude modulated sequence sampled at 
T/M,, where M,, is the oversampling factor. The output of your function is 7(n7/M,,), which is 
the sampled receive pulse-amplitude modulated signal (5.7). The parameters of your function 
should be the channel / (which does not include VEX ), the delay ty, /,,, M,, and the 


TX? 
frequency offset ¢. Essentially, you need to determine the discrete-time equivalent channel, 


convolve it with x(n7/M,,), resample, incorporate frequency offset, add AWGN, and resample 

to produce 7(nT/M..,). Be sure that the noise is added correctly so it is correlated. Design some 

tests and demonstrate that your code works correctly. 

52. Computer In this problem, you will create a symbol synchronization function. The input is 
y(nT/M,), which is the matched filtered receive pulse-amplitude modulated signal, M,., the 
receiver oversampling factor, and the block length for averaging. You should implement both 
versions of the MOE algorithm. Demonstrate that your code works. Simulate x(t) obtained from 
4-QAM constellations with 100 symbols and raised cosine pulse shaping with a = 0.25. Pass 
your sampled signal with M,, > 2 through your channel simulator to produce the sampled output 
with M_., samples per symbol. Using a Monte Carlo simulation, with an offset of ty = 7/3 and h 
= 0.323, perform 100 trials of estimating the symbol timing with each symbol synchronization 
algorithm for SNRs from OdB to 10dB. Which one gives the best performance? Plot the 
received signal after correction; that is, plot yinM,x +k* | for the entire frame with real on the 
x-axis and imaginary on the y-axis for 0dB and 10dB SNR. This should give a rotated set of 4- 
QAM constellation symbols. Explain your results. 

53. Computer In this problem, you will implement the transmitter and receiver for a flat-fading 
channel. The communication system has a training sequence of length 32 that is composed of 
length 8 Golay sequences and its complementary pairs as [ag, bg, ag, bg]. The training sequence 
is followed by 100 M/-QAM symbols. No symbols are sent before or after this packet. Suppose 
that M,, = 8, Ty= 47 +7/3, and h = e”3, There is no frequency offset. Generate the matched 
filtered receive signal. Pass it through your symbol timing synchronizer and correct for the 
timing error. Then perform the following: 

(a) Develop a frame synchronization algorithm using the given training sequence. Demonstrate 
the performance of your algorithm by performing 100 Monte Carlo simulations for a range of 
SNR values from 0dB to 10dB. Count the number of frame synchronization errors as a 
function of SNR. 

(b) After correction for frame synchronization error, estimate the channel. Plot the estimation 
error averaged over all the Monte Carlo trials as a function of SNR. 

(c) Compute the probability of symbol error. Perform enough Monte Carlo trials to estimate the 
symbol error reliably to 10. 

54. Computer Consider the same setup as in the previous problem but now with a frequency offset 
of ¢ = 0.03. Devise and implement a frequency offset correction algorithm. 

(a) Assuming no delay in the channel, demonstrate the performance of your algorithm by 
performing 100 Monte Carlo simulations for a range of SNR values from 0dB to 10dB. Plot 
the average frequency offset estimation error. 

(b) After correction for frequency synchronization error, estimate the channel. Plot the 
estimation error averaged over all the Monte Carlo trials as a function of SNR. 

(c) Perform enough Monte Carlo trials to estimate the symbol error reliably to 10. 


(d) Repeat each of the preceding tasks by now including frame synchronization and symbol 
timing with ty as developed in the previous problem. Explain your results. 


55. Computer In this problem, you will create a multipath channel simulator function where the 
complex baseband equivalent channel has the form 


K 
h(t) = Tsinc(t/T) * 5— a46(t — Tr) (5.418) 
| 


and the channel has multiple taps, each consisting of complex amplitude o, and delay t,. The 
input to your function is x(n7/M,,,), the sampled pulse-amplitude modulated sequence sampled 
at T/M,, where M,, is the oversampling factor. The output of your function is r(n7/M,,), which 
is sampled from 
rs 
r(t) = e27set oF apu(t — Te) + v(t). (5.419) 
k=1 


Your simulator should include the channel parameters M,,, 


M,,, and the frequency offset ¢. 
Check that your code works correctly by comparing it with the flat-fading channel simulator. 


Design some other test cases and demonstrate that your code works. 


Chapter 6. MIMO Communication 


Thus far the book has focused on wireless communication systems that use a single antenna for 
transmission or reception. Many wireless systems, however, make use of multiple antennas. These 
antennas may be used to obtain diversity against fading in the channel, to support higher data rates, or 
to provide resilience to interference. This chapter considers communication in systems with multiple 
transmit and/or receive antennas. Broadly this is known as multiple-input multiple-output (MIMO) 
communication. The chapter provides background on different techniques for using multiple transmit 
and receive antennas, including beamforming, spatial multiplexing, and space-time coding. It also 
explains how algorithms developed in prior chapters including channel estimation and equalization 
extend when multiple antennas are present. 


6.1 Introduction to Multi-antenna Communication 


Multiple antennas have been used in wireless communication systems since the very beginning, even 
in Marconi’s early transatlantic experiments at the beginning of the 1900s. Wireless communication 
systems have been making use of simple applications of multiple antennas for several decades. In 
cellular systems they have been used for sectoring (to focus the energy of the base station on a 
particular geographic sector), downtilt (to focus the energy away from the horizon), and receive 
diversity (to improve signal reception from low-power mobile stations). In wireless local area 
networks or Wi-Fi, two antennas have often been used for diversity in both the client (e.g., laptop) 
and the access point. These antennas would often be configured to be spatially far apart, for example, 
on either side of the laptop screen. These early applications did not require redesigning the 
communication signal to support multiple antennas. 


Several commercial wireless systems have integrated multiple antennas into their design and 
deployment. Techniques like transmit beamforming and precoding are supported by cellular systems 
(3GPP since release 7) and wireless local area networks (IEEE 802.11n, IEEE 802.1lac, and IEEE 
802.1lad). Systems that exploit multiple antennas for both transmit and receive are said to employ 
MIMO communication [256]. 


The use of MIMO comes from signal processing and control theory. It is used to refer to a system 
that has multiple inputs and multiple outputs. In the case of a wireless system, a MIMO system is 
created from the propagation channel that results from having multiple inputs (from different transmit 
antennas) and multiple outputs (from different receive antennas). There are several other related 
terms, illustrated in Figure 6.1, that are used to refer to different configurations of antennas. Single- 
input single-output (SISO) refers to a communication link with one transmit and one receive antenna. 
This is the usual communication link considered thus far in this book. The other configurations are 
summarized in this section using examples from flat fading and are generalized in subsequent 
sections. 


Figure 6.1 The terminology resulting from different antenna configurations: single-input single- 
output (SISO), single-input multiple-output (SIMO), multiple-input single-output (MISO), and 
multiple-input multiple-output (MIMO) 


The emphasis in this chapter is explicitly on the use of multiple antennas to create a MIMO system. 
In every direction of propagation using a transverse wave, though, there can be two orthogonal 
polarizations. Usually these are known as horizontal and vertical, circular left and circular right, or 
+45°. Every antenna has two patterns, one for its primary polarization and one for its orthogonal 
polarization. Some physical antenna structures, like patch antennas, can be used to send two different 
polarized signals by exciting different modes. In other cases, different antennas are used to send 
different polarizations. Channel models are more complicated with polarization. For example, the 
path loss when using vertical polarization is more adversely affected by rain than with horizontal 
polarizations. Also, the small-scale fading model becomes more complicated because of cross-pole 
decoupling (the leakage of one polarization into another) [39]. Polarization is important in cellular 
systems where the random orientation of the cellular phone creates different polarized signals. We 
defer explicit treatment of polarization in this chapter, but note that polarization is an effective 
complement to MIMO communication. 


6.1.1 Single-Input Multiple-Output (SIMO) 


SIMO refers to a communication link with multiple receive antennas and one transmit antenna. This is 
often called receive diversity and is perhaps the oldest use of multiple antennas [53, 261]. The basic 
idea is to add extra receive antennas and somehow combine the different outputs. The processing can 
be performed in analog or in digital. For example, an analog switch may be used to switch between 
two receive antennas. Alternatively, multiple RF chains and multiple continuous-to-discrete 
converters can be used so that all the antenna outputs can be processed together in digital. The 
potential benefits of SIMO operation include array gain (resistance to noise) and diversity gain 
(resistance to fading). 


Diversity is an important concept in wireless communications. In essence, diversity means having 
multiple avenues to get symbols from the transmitter to the receiver. If the system is well designed, 
each avenue sees a different propagation channel, which increases the likelihood that at least one of 
those channels is good. Diversity can be obtained in many ways in wireless systems. For example, 
with the use of coding and interleaving, coded data can experience channels separated by the 
coherence time, in what is called time diversity. Alternatively, a signal with a large bandwidth can be 
used to take advantage of the multiple paths between the transmitter and the receiver, in what is called 
frequency-selective diversity. Diversity can also be achieved using multiple antennas, in what is 
called spatial diversity. 


As an example, consider a flat-fading SIMO system with digital receiver processing as illustrated 
in Figure 6.2. After matched filtering, symbol timing, frequency offset correction, and frame 
synchronization, the output corresponding to each receive antenna can be written as 


] 


yi(n| = hys{n] + v1 [n] (6.1) 


yo|n| = has|n| + ve[n) (6.2) 


bits 


downconversion 


Figure 6.2 Block diag for a QAM system exploiting SIMO with receive processing in digital 
and multi-antenna linear equalization. The same transmitter structure 1s employed as ina SISO 
scenario. The receiver, though, includes two RF chains. Some processing steps may be performed 
jointly to leverage common information received on both receive antennas. This example shows 
symbol synchronization being done jointly. The channel estimation may be done independently for 
each receiver, but the equalization operation should combine both received signals prior to 
detection. Frame synchronization and frequency offset synchronization are not shown but can also 
leverage multiple receive antennas. 


or in matrix form as 
y(n] = hs{n] + vin]. (6.3) 


The scalars /, and h, are the channels from the transmit antenna to receive antennas | and 2, 
respectively. If the antennas are spaced far enough apart—greater than the coherence distance—then 
the channels are said to be uncorrelated. This provides the receiver with diversity since it has 
multiple opportunities to extract the same information sent through different fading channels and 
augmented by a different additive noise term. Good receiver signal processing algorithms take 
advantage of both observations in the equalization and detection process. 


Selection diversity is one example of a way to process signals from multiple receive antennas 
[261]. The channel with the largest magnitude is selected (given by k) and then Yk is processed. This 
system obtains diversity from fading because if h, and h, are independent (which follows if the 
channels are jointly Gaussian and uncorrelated), then the probability that |h,| and |/,| are both small is 
much smaller than the probability that either one is small. 

There are many other kinds of receiver processing techniques, including maximum ratio combining 
(MRC) [52], equal gain combining (which used to be known as the linear adder [10]), and joint 
space-time equalization [90, 351, 241]. An illustration of the impact of receive diversity is provided 
in Figure 6.3. 


channel 1 
channel 2 
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time 
Figure 6.3 The impact of diversity in a two-antenna SIMO system. One realization for each 
channel is shown. The effective channels created using antenna selection diversity and maximum 
ratio combining are also shown. Receive antenna diversity reduces the occurrences of deep fades. 


Virtually every technique studied thus far in this book can be extended to the case of multiple 
receive antennas without dramatic changes to the resulting signal processing algorithms. Consider 
how functions like symbol synchronization, frame synchronization, and frequency offset correction 
might incorporate observations from multiple antennas. SIMO is discussed in more detail in Section 
6.2. 


6.1.2 Multiple-Input Single-Output (MISO) 


MISO refers to a communication link with multiple transmit antennas and one receive antenna. This is 
often called transmit diversity or transmit beamforming. Multiple transmit antennas have been in wide 
use only recently in commercial wireless systems as they often require additional signal processing at 
the transmitter to extract the most benefit from the antennas. 


A MISO system uses multiple transmit antennas to convey a signal to one receive antenna. 
Generally, communication in the MISO case is much more challenging than in the SIMO case because 
the transmit signals are combined by the channel. Obtaining some advantage from spatial diversity is 
much more difficult. From a signal processing perspective, either some knowledge about the channel 
is required at the transmitter or the information needs to be spread in a special way across the 
transmit antennas to obtain a diversity advantage. 


One approach for using multiple transmit antennas that does not require channel state information at 
the transmitter is known as transmit delay diversity [357]. A variation of delay diversity (called 
cyclic delay diversity [83]) 1s used in IEEE 802.11n. The basic idea, as illustrated in Figure 6.4, is to 
send a successively delayed version of the transmitted signal on each transmit antenna. The effect is 


to create a frequency-selective channel from a flat-fading channel. For example, if the delay is exactly 
one symbol period and there are two transmit antennas (after matched filtering, symbol timing, 
frequency offset correction, and frame synchronization), then 


y[(n] = hysy|[n] + hase|n| + vi[n| (6.4) 


= h,s[n] + hos[n — 1] + v[n] (6.5) 
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Figure 6.4 Block diagram for a QAM system exploiting MISO with delay diversity at the 
transmitter. The transmitter structure is modified over the SISO case by the inclusion of a space- 
time coding block (the dashed box), which in this case implements the delay operation. The 
receiver structure is similar to the SISO case, except that estimation of the channels from each 
antenna is required in general and the equalization may be more complex. With delay diversity, the 
receiver can use processing similar to that for frequency-selective channels. Frame 
synchronization and frequency offset synchronization are not shown but can also leverage multiple 
receive antennas. 


where y[n] experiences intersymbol interference! At first 1t seems like it is bad that now an equalizer 
is required. But note that s[n] contributes to both y[n] and y[n + 1]; thus there are two places to 
extract meaningful information about s[n]. This is known as frequency-selective diversity. 
Unfortunately, linear time-domain equalizers do not reap much of an advantage from this sort of 
diversity. To obtain the most benefits, SC-FDE is required, or coding and interleaving with OFDM, 
or a nonlinear receiver like the maximum likelihood sequence detector. 


There are a number of different approaches for using multiple transmit antennas. Approaches that 
do not use channel state information at the transmitter are known as space-time codes [326, 8, 327, 
125]. Delay diversity is an example of a space-time code. Essentially, with space-time coding, 
information is spread across all of the transmit antennas. More sophisticated receiver algorithms are 
then used to detect the transmitted signal. The main advantage of space-time coding is that it can 
obtain a diversity advantage without requiring information about channel. The main disadvantage is 
that (except in special cases) achieving the best performance requires higher-complexity algorithms at 
the receiver. 


There are several ways to use channel state information at the transmitter, the most common being 
for transmit beamforming. The idea 1s to change the amplitude and phase of the signal sent from each 
antenna so that the signals combine favorably at the receiver. This results in better performance than 
is achieved when the channel is unknown but requires additional system design considerations to 
obtain an estimate of the channel at the transmitter [207]. 


An illustration of the impact of different diversity combining techniques is shown in Figure 6.5. 
Approaches that use channel state information at the transmitter outperform approaches that do not 
exploit knowledge of the channel. Learning the channel at the transmitter requires additional support 


in the communication system, for example, the use of feedback. MIS, including space-time coding and 
beamforming, is discussed in more detail in Section 6.3. 
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Figure 6.5 The impact of diversity in a two-transmit-antenna MISO system. One realization for 

each channel is shown. The effective channels created by sending the same signal across both 

antennas (spatial repetition) is also shown, along with maximum ratio transmission (which uses 
channel state information at the transmitter) and the Alamouti space-time block code. Sending the 
same information across both transmit antennas results in a channel that is no better than either of 
the original channels. Maximum ratio transmission obtains better performance compared with the 

Alamouti code but requires knowledge of the channel at the transmitter. 


6.1.3 Multiple-Input Multiple-Output (MIMO) 


MIMO wireless communication refers to the general case of a communication link that has multiple 
transmit and multiple receive antennas. SISO, SIMO, and MISO are also considered special cases of 
MIMO communication. The principle of MIMO communication is the joint design of transceiver 
techniques that make simultaneous use of multiple communication channels. 


MIMO communication is an established technology in wireless communication systems. It has been 
widely deployed in wireless local area networks through IEEE 802.11n and 1s a key feature of 


various third- and fourth-generation cellular standards and next-generation wireless local area 
network protocols. At this point, MIMO is incorporated into millions of laptops, phones, and tablets. 


Spatial multiplexing, also called V-BLAST [115], is one of the most widely used transmission 
techniques for MIMO systems [255]. The idea of spatial multiplexing is to send different data from 
each transmit antenna, using the same spectrum and with the same total power as ina SISO system. 
The transmitted data streams are mixed together by the MIMO channel, creating cross-talk or self- 
interference. Because each receiver sees a different combination of signals from all the transmitters, 
joint processing can be used to separate out the signals. Essentially, the cross-talk between different 
transmitted signals can be canceled. This allows multiple interference-free channels to coexist, using 
the same spectral resources, thus increasing the channel capacity [130, 328]. 


The idea of spatial multiplexing is illustrated in Figure 6.6 for the case of two transmit and two 
receive antennas. In this example, two symbols are sent at the same time and the transmit power is 
split between them. This allows the symbol rate to be doubled without increasing the bandwidth. 
Assuming a flat-fading channel (after matched filtering, symbol timing, frequency offset correction, 
and frame synchronization), the received signals can be written as 


yi[r] = hy,181[n] + hy 282[n] + vy [n] (6.6) 
yon] = ho1s81[n] + he,282[n] + ve[n] (6.7) 
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Figure 6.6 Block diagram for a QAM system exploiting MIMO spatial multiplexing with two 
transmitter and two receiver antennas. The spatial multiplexing operation is illustrated in the 
dashed box on the transmitter. This is essentially a 1:2 serial-to-parallel converter (also correctly 
called a demultiplexer). The demultiplexing operation takes one symbol stream s[n]and produces 
two symbol streams s,[n] = s[2n] and s,[n] = s[2n + 1]. The power splitting is shown explicitly. 
The receiver involves the usual synchronization operations (frame and frequency offset 
synchronization not shown) followed by joint channel estimation, joint equalization, and 
independent symbol detection. Sometimes the detection and joint equalization are combined. After 
the detection is the spatial demultiplexing operation, which is essentially a 2:1 parallel-to-serial 
converter (also correctly called a multiplexer). 


or in matrix form 


y|n] = Hs[n] + vin}. (6.8) 


The coefficients h, ¢ denote the channel between the ¢*> transmit antenna and the k*" receive antenna. 
Assuming that the channel coefficients are different, each receiver observes a different linear 
combination of the symbols s,[”] and s,[7] and the channel matrix will be invertible. A simple way to 
remove the effects of the channel is to invert the matrix in (6.8) and apply this inverse to the received 
signal to obtain 


S=H'y[n] (6.9) 


followed by independent detection. 


MIMO systems with spatial multiplexing are much more complex to implement than SISO systems 
because of the self-interference created by the transmit antennas. This complicates channel estimation, 
synchronization, and equalization among other system functions. MIMO systems are presented in more 
detail in Section 6.4, followed by treatment of MIMO-OFDM because of its wide use in Section 6.5. 


6.2 Receiver Diversity for Flat-Fading SIMO Systems 


Multiple receive antennas provide a means of receiving multiple copies of the same signal. Each 
copy sees a different noise realization and a different channel realization. In this section, we focus on 
two algorithms for combining the outputs of all the antennas ina SIMO system and provide some 
analysis of those algorithms. We begin with a brief review of SIMO flat-fading channel models, 
explaining how Rayleigh fading generalizes to the SIMO case. Then we describe two algorithms for 
processing the antenna outputs ina SIMO system: antenna selection and maximum ratio combining. 
We provide some analysis of these algorithms for Rayleigh fading. We defer treatment of frequency- 
selective channel estimation and synchronization to the MIMO case (which specializes to SIMO). 


6.2.1 SIMO Flat-Fading Channel Models 


The received signal in a SIMO flat-fading channel model with block fading is given in (6.3), but with 
all the vector dimensions extended to NV, x 1. The channel for the SIMO flat-fading case is 


represented by the N. x 1 vector h. As in the SISO case, it is common to decompose multi-antenna 


channels into their large-scale and small-scale components. Assuming that the antennas are colocated 
(in an area separated by a few wavelengths), then each antenna sees the same large-scale component 
and only the small-scale coefficient differs. Then the channel may be decomposed as 


h = VGh, (6.10) 


where the large-scale gain is G= E,/P,4(d), Prjin(@) is the distance-dependent path-loss term in 
linear (to distinguish it from the more common decibel measure), and the small-scale fading 
coefficients are collected in the vector hy. 

There are many multi-antenna stochastic channel models for h,. Two conceptual models are 
illustrated in Figure 6.7. The most common model used in MIMO systems is the Rayleigh model with 
spatial correlation where h, has vector complex Gaussian distribution Ne(0, Rex) where the main 
parameter is the receive spatial correlation matrix given by Rx = E{hsh2) ty most small-scale 
models, it is common for the average energy of the channel to be normalized to 1; in the vector case 
this means that [R,,];, ; = 1. The off-diagonal terms in the spatial correlation matrix [R,,];, ¢ give the 


spatial correlation between the channel seen by antenna & and the channel seen by antenna ¢. This 
function is defined similarly to the spaced-time correlation function or the spaced-frequency 
correlation function. The spatial case of R,, = Lis known as IID Rayleigh fading or sometimes just 
Rayleigh fading. In this case, the channels experienced by each receive antenna are completely 
uncorrelated. In cases where R,, is non-identity, the channel coefficients may be correlated. The 
exponentially correlated model is often used for analysis where [R,], ¢ = p*—*: 5 is the complex 
correlation coefficient between two adjacent antennas. 
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Figure 6.7 Two different channel-modeling scenarios. In (a), there is rich scattering, which means 
that multipaths arrive from many different directions. Alternatively, in (b), there is clustered 
scattering. Multipaths are clustered together in potentially different-size clusters. 
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Most of the analysis in this chapter is based on the IID Rayleigh fading channel. This is normally 
justified by the rich scattering assumption where there are many multipaths in the environment and the 
antennas are spaced far enough apart. To be uncorrelated, the antennas should be spaced by the 
coherence distance, which is a selectivity quantity that can be related to the coherence time. For 
WLAN and cellular applications, the coherence distance is anywhere from A/2 on devices surrounded 
by clutter to 10 for antennas that are high up on a tower, where A is the wavelength. Further 
decorrelation can be achieved by making the patterns of the antennas different [111, 112, 263]. 


The clustered channel model [313], a spatial generalization of the Saleh- Valenzuela channel model 
[285], is also used in multi-antenna systems for channel modeling. Suppose that there are C clusters in 
the propagation environment, each with R rays. Let the vector a(@) denote the array response vector, 
which is 

a() = [1,e7s2528i0) eH Me-1) 254 sin(9)) T (6.11) 


for the case of a uniform linear array with isotropic elements. The term A/y is the spacing between 
two adjacent antenna elements in terms of wavelength. The array is oriented on the z-axis, and the 
angle @ is called the azimuth direction. Let 0, ,. denote the angle of arrival of the 7 ray from the cl 
cluster. Let a, ,. denote the complex gain of that ray, often modeled as Rayleigh. The clustered channel 
is given as the sum of the rays 

C-1R-1 


1 
h, = ——= ntl a 6.12) 
1, ree a(@.y) (6.12) 


c=0 r=0 


The factor VC of is used to satisfy the power constraints. This clustered model formulation 
assumes that all the clusters are equidistant. Further modifications account for differences in path loss 


for each cluster. 


The clustered channel model is is often generated stochastically by making assumptions about the 
distribution of clusters and angles. Suppose that each cluster is associated with a mean angle of 
arrival 0... Itis common for the mean angle of arrival to be uniformly distributed. Suppose that the 


rays are generated around each cluster with a certain power distribution, called the power azimuth 
spectrum P,,(0). Let a,, be the angle spread associated with the distribution. The most common 


choice for MIMO channel modeling is the Laplacian distribution, which is 
oS aD = 20 
ee e | 7% 


fy 
V 20 as 


Fax\6) =  € [-7,7) (6.13) 
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where 1-exp(~V2r/eas) ig a normalization factor needed to make the function integrate to 1. 
The power azimuth spectrum for each cluster is then shifted by the cluster’s mean angle of arrival as 


Pi A(O ~ O.)an): 
Using these distributions, it is possible to generate (6.12) via simulation. It is also possible to 
derive an equivalent R,, for the clustered channel model [298, 349, 178], though the expressions have 


high computational complexity. Lower-complexity solutions follow using a small angle 
approximation [113]. In this sense, the clustered channel model provides a way to generate the spatial 
correlation matrix for a correlated Rayleigh channel, based on parameters of the propagation 
environment. In general, having fewer clusters and/or a small angle spread leads to significant 
correlation (R,, 1s far away from an identity matrix). Alternatively, having many clusters and/or a 
large angle spread tends to reduce correlation and converges eventually to the IID Rayleigh channel. 


A special case of the clustered model is the single-path channel where 


h, = aa(@). (6.14) 


In this case, the elements of h, are fully correlated since a is common for all antennas. This kind of 


channel model has been widely used in statistical array processing and in smart antenna 
communication systems. It is most practical in cases where there is only one significant propagation 
path, making it more suitable for LOS links for higher-frequency communication systems like 
millimeter wave [145, 268]. 


6.2.2 Antenna Selection 


One simple technique for receive diversity ina SIMO system is called selection combining, wherein 
the receiver measures the SNR on each antenna and chooses to decode the signal from the antenna 
with the highest SNR [261]. An illustration of a receiver implementing selection combining is shown 


in Figure 6.8. Consider the received signal on the i" antenna 
yi[n] = hys[n] + v,[n] (6.15) 


Selecting best 


Envelope of channel receive antenna 
Antenna 
selection 


Figure 6.8 Block diagram of a SIMO system implementing antenna selection 
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fori=1,...,N,. Let i* be the solution to arg max,-; 9 y. n°. Since the SNR of the i" branch given 
h; is \h,°/N,, this selection results in choosing the m that gives the highest SNR. 
The signal corresponding to 7*, 


yar [nr] = hy s[n] + vj«[n], (6.16) 


is selected for symbol detection. The post-selection SNR is the effective SNR after the selection, 
given knowledge of the channel: 
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(6.17) 


A good post-selection SNR is achieved if the channel from at least one antenna is good. 


The performance achieved by selection diversity depends on the assumptions about the fading 
distribution. In Example 6.1, Example 6.2, and Example 6.3, the Chernoff bound on the probability of 
symbol error is calculated for uncorrelated Rayleigh fading. 


Example 6.1 
Consider a SIMO system with NV, receive antennas, and suppose that the channel coefficients are IID 


Nc(0,G), Compute the CDF and PDF of / 07. 
Answer: The CDF of |h;*? is the CDF of max; |h,” given by P (max; |hy|? < 2) 


4 <2) (6.18) 


P (max |hi|? < x) = P(|hy|? < x) P (|ho|? < 2)---P (hy, 
z 


since the h; are independent. 

In general, the sum of K N (0, 1) squared random variables has a chi-square distribution with K 
degrees of freedom. Because each h; is Ne(0, G) ln? is a scaled chi-square distribution with 2 
degrees of freedom (because h; is complex). In this case, the distribution is especially simple and is 


Pil)? <2) ele, (6.19) 
Therefore, 
' ia\ Ne 
P (max h,|* < x) —— (1 —e 7 “) (6.20) 
The PDF is computed from 
1 1 iG IGN.N : 
ab (max |h;|? < r) =e CP eye hy (6.21) 
A 1 a 


Example 6.2 
For the same setting as in Example 6.1, assuming G = E,/P,j,,(d), determine the average post- 


selection SNR given by E{max; |hj|*]No, 


Answer: 
1 ? l 20 ~ ff1I. NI - 
— IE, [max | |?| =F i ie FE fy — greene bay dip (6.22) 
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Note that increasing NV, provides a diminishing gain to the average SNR, a result known as SNR 
hardening. 


Example 6.3 


For the same setting as in Example 6.1, compute the Chernoff bound on the union bound on the 
average probability of symbol error, neglecting path loss, that is, G = E,. 


Answer: 
P, <(M-1)E C (,/ syn max iu) (6.25) 
P “= le fer abot max; al) (6.26) 
7 os 1 [ en Tie Fant g-2/G(y e~t/G)N-1N de (6.27) 
- J0 
_M-1 NV (4¢-@2 in + UT (Nr) (6.28) 
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where is the gamma function. Performing a series expansion around G/N, 
and substituting for G = F,/P,,;;,(d), neglecting higher-order terms, it can be concluded that 
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(6.29) 
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N, 
Ry 
The factor of ( No is common in many calculations for Rayleigh fading channels involving the 


probability of symbol error. Recall that for the SISO case from Section 5.8.3, the probability of 
cs 


symbol error was proportional to €. ) for high SNR. The additional exponent of N, in the case of 


selection diversity is called the diversity gain. A comparison of the exact probability of symbol error 
is provided in Figure 6.9 for different values of NV. for QPSK modulations. Exact expressions for the 


probability of error are available in the literature; see, for example, [310] and the references therein. 
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Figure 6.9 Performance of selection combining in Rayleigh fading channels for QPSK modulation 
and different values of N, 


There are other variations and extensions of antenna selection. One variation is known as scanning 
diversity [142, 53]. The idea is to scan the received signal successively on different antennas, 
stopping on the first antenna that is above a predetermined threshold. That antenna is used until the 
signal drops below a threshold, when the scanning process begins again. In this way the receiver does 
not have to measure the signals from all the receive antennas, allowing the use of an analog antenna 
switch in hardware, at the expense of worse performance. 
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Figure 6.10 Block diagram of a SIMO system implementing maximum ratio combining 
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Antenna selection can be extended to frequency-selective channels [21, 22, 23, 242]. Ina single- 
carrier system, the receiver would need a better metric for selecting the best antenna, for example, 
based on the mean squared error of the equalizer. Inan OFDM system, antenna selection could be 
performed independently as described on each separate subcarrier. It might make more sense, 
however, to perform the switching in analog prior to the OFDM demodulator to reduce the hardware 
requirements. This would require a different metric to choose the antenna that is good for many 
subcarriers [189, 289]. 


A main disadvantage of antenna selection is that it throws away the signals on the receive antennas 


that were not selected for processing. This prompts more sophisticated algorithms that combine the 
received signals from several antennas. 


6.2.3 Maximum Ratio Combining 


Maximum ratio combining (MRC) is a receiver signal processing technique for SIMO systems that 
combines the signals from the receiver antenna to maximize the post-combining SNR [52]. The 
objective of MRC is to determine the coefficients of a vector w, called a beamforming vector, such 
that the combined signal 


r[n] = w*y[n] (6.30) 
= w’hs|n] + w*v([n] (6.31) 
has the largest possible post-processing SNR. 
The post-processing SNR is the effective SNR after beamforming, conditioned on knowledge of h: 


a lw*h/|? | 
i Nollw|l? 
Expanding the norm and absolute values, 
: w*h)(w*h)* 
SNRin = ( a) ) , (6.33) 
Now*w 
The objective is to find w to maximize the post-processing SNR: 
*} *h)* 
WwW = arg max Sil al! .; (6.34) 


weCNr Now*w 


To solve this problem, recognize that w*h is just the inner product between two vectors. From the 
Cauchy-Schwarz inequality, it follows that 

lw*h|” < ||w||*|/h|| (6.35) 
where equality holds if w = ch and c is an arbitrary nonzero complex value. Consequently, 


(w*h)(w*h)* _ (c*h*h)(ch*h)’ 


: 6.36: 
Now*w ~  N,|cl*h*h eed) 
3 h*h (6.37) 
— N, Die fl 
= —|[hI|? (6.38) 
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This reveals that the choice of c is immaterial, so it can be set to c = 1 without loss of generality. This 
also provides a convenient expression for the post-combining SNR that can be used for subsequent 
performance analysis. The intuitive interpretation is that w is a spatial matched filter, matched to the 


vector h. 
The resulting post-combining SNR with the matched filter is then 
\|h 


| 2 


SNR, na (6.39) 
7||h,||? 
_ on | (6.40) 


Since (ni? > max; |h;|", MRC outperforms antenna selection. 


Example 6.4, Example 6.5, and Example 6.6 consider the average SNR and the probability of 
symbol error after optimum combining, assuming an IID Rayleigh fading channel. 


Example 6.4 
Consider a SIMO system with NV, receive antennas, and suppose that the channel coefficients h,; are 

on " 2, MRC 
1D Ve(0, G), Compute the CDF and PDF of NRin 

AS Y Ny } 2 
Answer: Since h; are Nc(0, G) it follows that Piaos [i is the sum of 2N, IID N (0, G/2) 
2 

random variables. This is a scaled (because of the 1/2) chi-square distribution denoted 2, with 2N, 


degrees of freedom. As a result, |{h P/N, is also a scaled chi-square distribution, or equivalently a 
gamma distribution with T (NV,, G/N,). One form of the CDF is given by 


‘ Ny- r m 
|a||? - _ 1 _No __ No Sere 
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whereas the PDF is 


d ||h||? i No Ms ie Ri: poe — 
a? ( vy, <*)=mile) °°? (6.42) 


Example 6.5 
For the same setting as in Example 6.4, determine the average post-combining SNR. 
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In this case the SNR increases linearly with N,, unlike the case with antenna selection. The increase 
in effective SNR here is also known as the maximum array gain. 


Example 6.6 
For the same setting as in Example 6.4, compute the Chernoff bound on the probability of symbol 


error assuming G = E,. 
Answer: 
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The performance analysis in Example 6.6 shows that MRC also achieves a diversity gain of N,. A 
comparison of the exact probability of symbol error is provided in Figure 6.11 for different values of 
N, for QPSK modulations. Exact expressions for the probability of error are available in the literature 


(see, for example, [310] and the references therein). Comparing Figure 6.9 and Figure 6.11, it can be 
seen that at high SNR, MRC provides about a 2dB gain over antenna selection. This is the extra 
benefit achieved from optimum combining. 
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Figure 6.11 Performance of MRC in Rayleigh fading channels for QPSK modulation and different 
values of WN, 


It is possible to view antenna selection through the lens of MRC. Essentially, antenna selection 
chooses combining weights from the set 


0 0 
0 1 0 . 
Le (6.50) 
0 fe) | 


which is smaller than CX. Thus it is expected that MRC should perform better than antenna selection 
because the antenna selection is included in the possible set of beamforming vectors used by MRC. 


MRC can be extended to frequency-selective channels. In a single-carrier system, more complex 
beamforming would be employed. For example, the vector beamformer might be replaced witha 
combined equalizer and combiner [90, 351, 241]. Essentially each receive antenna would be filtered 
and the results combined. The weights for such an equalizing solution can be derived using the least 
squares approach in Chapter 5; variations are described for MIMO systems in Section 6.4.7. MRC 
can be applied in a straightforward fashion in an OFDM system by performing the combining 
individually on each subcarrier. 


There are other kinds of beamforming besides MRC. For example, the beamforming weights can be 
chosen to minimize the mean squared error at the receiver. This would allow for the beamforming to 
deal with spatially correlated noise, created, for example, if there is also interference. Such 
beamforming techniques provide better performance in the presence of correlated noise or 


interference. 

A beamforming technique related to MRC is equal gain combining (EGC) [10, 53, 165]. With 
EGC, the beamforming vector is constrained to have the form w! = [e1, ei, . |, lm]. Unlike 
MRC, with EGC only phase shifts of the received signal are allowed. Let 0; = phase(h,). Solving a 
similar optimization to the one 1n (6.34) over the space of constrained beamforming vectors gives the 
solution w/ = eel, oe elnr] ! The arbitrary phase 0 is usually selected to be either 0 or —0, 
(phase synchronizing to the first antenna branch). EGC matches the phases of the channels so that they 


Ex (Sv |)? 
add coherently at the receiver. The resulting SNR is Vo (20; |hil) . It can be shown (though the 
calculation is not trivial [14]) that EGC achieves a diversity gain of N,. At high SNR, in Rayleigh 
fading channels, EGC typically performs about 1dB worse than MRC and 1dB better than antenna 
selection. 


6.3 Transmit Diversity for MISO Systems 


Multiple transmit antennas can be used to send potentially different signals into the propagation 
environment. Unlike multiple receive antennas, with multiple transmit antennas, the signals are 
combined in the air by the channel. This subtle difference leads to substantial algorithmic changes in 
the system design. This section reviews approaches for exploiting transmit antennas both with and 
without channel state information at the transmitter in MISO systems. When properly designed, 
transmit diversity techniques can achieve diversity gains in Rayleigh fading channels similar to those 
achieved with receive diversity in SIMO systems. 


6.3.1 MISO Flat-Fading Channel Models 

The received signal model ina MISO system depends on the transmission strategy, for example, 
space-time coding or transmit beamforming. The channel for the MISO flat-fading case, though, can 
be represented similarly to the SIMO case, except as a 1 x N, vector given by h*. The conjugate 


transpose is used because all vectors in this book are column vectors; the conjugate transpose 1s used 
instead of the transpose just for convenience. With a transpose, all the channel models for SIMO flat 
fading in (6.2.1) also apply to the MISO case. 


Assuming that the antennas are colocated (in an area separated by a few wavelengths), each 
antenna sees the same large-scale component and only the small-scale coefficient differs. Then the 


channel may be decomposed as in the SIMO case as h = /Gyaimohs where in this case 
E 


asians Pr .tin(d) Nt N+. The extra factor of NV, comes from the scaling of the transmit signal by 


V Ex/ Nt and will be clear later. All the large-scale models for P,.,(d) can be used in the MISO 
case without change. The multi-antenna stochastic channel models for h, are similar to the SIMO 
case. There is a difference in terminology, though. The Rayleigh model with spatial correlation where 
h, has vector complex Gaussian distribution Ne(0, Rix) where Rex = E [hsb] js the transmit 
spatial correlation matrix. In the clustered channel model, 0, ,. becomes the angle of arrival (not the 
angle of departure) of the 7 ray to the c“ cluster. Now we proceed to explain the performance of 
different communication strategies in the MISO channel. 


6.3.2 Why Spatial Repetition Does Not Work 


The naive approach to using multiple transmit antennas is to send the same signal from all the 
antennas. This section explains the fallacy of this “intuitive” approach, as it breaks down in fading 
channels. 


Suppose that the same symbol s[7] is sent from each transmit signal. The pulse shaping is the same 
for all the transmitted signals. Assuming perfect synchronization and after matched filtering, the 
received signal is 


y[n] = hys|n] + hasi[n] + ---+hy,s[n] + vin]. (6.51) 


Collecting terms together, 


y|n| = s[n] (hy +ho+--++hy,) + vIn]. (6.52) 


N, 
At this point, it is tempting to be optimistic, because the effective channel here is the sum iat Mi, 
Unfortunately, the channel coefficients are complex numbers and the addition is not coherent. 


To expand further, we consider the IID Rayleigh fading channel where h; is IID Nc(0, Guo), 
Because sums of Gaussian random variables are Gaussians, (h; + hy +: - » + Ay.) is Gaussian with 


mean 


E (hy + ho BC hn, ) =.) (6.53) 


and variance 


E (ha - ho 5 ie hy.) | = E| h,|?] “ E| hg|?] tT: -Ellhy, |?) (6.54) 
= NtGuimo (6.55) 
= a (6.56) 
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Therefore, this sum channel (4, +h, +---+h Nn) is Vc(0,G). This means that (6.51) is equivalent 


(in the sense that the distribution of the effective channel is the same) to 


y(n] = hs[n] + v[n], (6.58) 


which is just the usual SISO fading channel. Consequently, there is no diversity if the symbol is 
repeated from multiple transmit antennas. This was illustrated in Figure 6.5, where it was shown that 
the effective post-processing SNR resulting from spatial repetition is no better than that from 
individual channels. 


The failure of spatial repetition to coherently combine in the channel encourages the development 
of other transmission techniques. In the next subsections, we introduce two such approaches. One 
main approach is transmit beamforming, where the objective is to adjust the phase of the signal sent 
on each antenna so that the resulting signals combine more coherently. The other approach is space- 


time coding, where the transmitted signals are encoded in such a way that the coded symbols 
experience many different combinations of the channel coefficients. Both approaches, when properly 
designed, lead to a diversity of NV, in Rayleigh fading channels. 


6.3.3 Transmit Beamforming 

Exploiting channel state information is one means to avoid the pitfalls in Section 6.3.2. One way to 
exploit channel state information at the transmitter is through transmit beamforming as illustrated in 
Figure 6.12. In this case, the same symbol is sent from each transmit antenna but is scaled by a 
complex weight /; that is allowed to depend on the channel state. The role off; is to change the phase 
and amplitude of the transmitted signal to better match the channel and lead to a good coherently 
combined over-the-air signal. 
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Figure 6.12 The operation of transmit beamforming. The weights f; may be determined based on 
the channel state. After multiplication by the weight, the resulting signal f;s[7] 1s passed through the 
usual pulse-shaping, discrete-to-continuous conversion, and RF operations. 


Define the beamforming weight vector as f’ = [f,, fi, .... ial . The received symbol, after 
matched filtering and synchronization, is 


y[n| = h*fs[n] + v[n]. (6.59) 


The effective received channel h*f is simply the inner product between h and f. To maintain the 


transmit power constraint, f is scaled such that Sak fj || s[n]|°] = Ne, which is achieved by 
beamforming vectors that satisfy |{f\? = N,. The choice of JN, is the result of our assumption that each 
antenna branch is scaled by E,/N,. Unlike the case of receive beamforming, the choice of the transmit 
beamforming vector does not impact the noise variance. 

As in the case of MRC, it makes sense to choose the beamformer f to maximize the received SNR. 
The objective is to maximize |h*f?/N, subject to if? = N,. The resulting beamformer is known as the 


maximum ratio transmission (MRT) solution. Using the same arguments as in Section 6.2.3, the 
optimizer has the form f = ah. To satisfy the power constraint with maximum power, observe that 


f*f = |a|*h*h (6.60) 
= MN. (6.61) 


Therefore, the scaling factor a must satisfy |a|? = N//| hi’. The simple solution is to take 
a= VN-/||bl tn this way, the transmit beamformer acts as a spatial matched filter to the channel 
and is given by f = VN h/|\h} 
The received SNR given MRT is 
snRMar _ [Tel (6.62) 
No 
N,|h*h|? 
~ [hI 
Nt 
No 
» Gummo 
No 


(6.63) 


= ||h||? (6.64) 


= ||h,|| (6.65) 
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= ||h,||?—. 
| ds | No 


(6.66) 


Comparing (6.40) and (6.64) reveals that the SNR is the same for both MRT and MRC! The 
performance analysis in Example 6.4 through Example 6.6 therefore also applies to the case of MRT. 
In particular, we can conclude that MRT offers a diversity order of N, in IID Rayleigh fading 


channels. 
Other constraints on the beamforming weights are also possible. For example, forcing each weight 


to have the form €XP(J9«)/V Nt would yield something equivalent to EGC, resulting in the equal 
gain transmission solution [205, 237]. This approach is attractive because it allows each transmit 
antenna to send a fixed fraction of the power £,/N, instead of being designed for full power £,. It is 


also possible to implement phase shifting in analog using digitally controlled phase shifters. Selecting 
the weight vector from the set S,., (scaled by V x) leads to transmit antenna selection [283, 146, 


228, 288]. Transmit antenna selection seems to be most interesting when the switching operation can 
be performed in analog, sharing one RF chain among many possible transmit antennas [167]. 


The main disadvantage of transmit beamforming based on the channel state 1s the requirement that h 
be known (perfectly) at the transmitter. This leads to developing strategies that make use of only 
information about h that is acquired from quantized channel state information sent from the receiver 
back to the transmitter. 


6.3.4 Limited Feedback Beamforming 


Limited feedback describes a methodology in wireless communication systems where the transmitter 
is informed about the channel state through a low-rate (or limited) feedback channel from the receiver 
back to the transmitter [211, 207]. 


Feedback channels are widely used in wireless communication systems to support many other 
system functions, including power control and packet acknowledgments. Because feedback channels 
consume system resources, their capacity is limited by design. This means that they must convey as 
little information as possible to accomplish the assigned task. 

The idea of limited feedback beamforming [208, 235, 207] is for the receiver to select the best 
possible transmit beamforming vector from a codebook of possible beamforming vectors. It turns out 
that this approach 1s more efficient than quantizing the channel directly because it leads to further 
feedback reductions. 


Let F= {f,, f,,..., fy, 3 denote a codebook of beamforming vectors. Each entry f; is an , x 1 


vector. All the vectors satisfy |jf;|| =, to maintain the power constraint. There 1s a total of N;p vectors 
in the codebook; typically N; 1s chosen to be a power of 2. In the case of MISO transmit 
beamforming, a reasonable approach to selecting a good transmit beamforming vector is to find the 
index of the vector that maximizes the received SNR: 


' 9 F ‘ 
= argmax ||h*£; ||* (6.67) 


- 
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j€{1,2,....Nie}4%o 


= argmax \|h*£;||?. (6.68) 


Essentially the best beamforming vector is f;*, which has the strongest inner product with the channel 
h. The receiver sends the index j* back to the transmitter through the limited feedback channel. 


The performance of limited feedback beamforming depends on the specific choice of the codebook 
F. Typically the codebook is designed ahead of time and specified in a standard. The most common 
design criterion is to find a codebook such that the minimum inner product is maximized, making the 
beamforming vectors far apart in some sense. This leads to a Grassmannian codebook (exploiting the 
connection between this beamforming problem and packings of lines on the Grassmann manifold 
[208, 235]). Other codebook designs are possible to satisfy other objectives, for example, making the 
codebook easier to search or reducing the precision required for its storage [358, 284, 150, 195]. To 
provide a concrete example, note that 3GPP LTE release 8 uses the following beamforming codebook 


for two antennas with Nj p= 4: 
oe ee ee 


Analyzing the performance of limited feedback beamforming in fading channels 1s difficult, since it 
requires the distribution of ||h*f;* ||, which depends on the codebook. It has been shown that in 
Rayleigh fading channels, a diversity order of NV, can be achieved for Grassmannian codebooks 
provided that N, p= N, [208, Theorem 4]. More detailed analysis of limited feedback beamforming in 


6.3.5 Reciprocity-Based Beamforming 


An alternative to limited feedback beamforming is to use reciprocity in the propagation environment. 
In the case of a MISO system, this involves deriving the transmit beamforming vector from the 
channel observed ina SIMO system. To explain, suppose that user A with N antennas used for both 
transmit and receive communicates with a user with one antenna for transmit and receive. Denote the 


SIMO channel as hp_, , and the MISO channel as hip . The principle of reciprocity 1s that the 
propagation channel from A to B is the same as the propagation channel from B to A. This means the 
path gains, phases, and delays are the same. Using our notation, this means that has = he_.,. 
Using reciprocity, user A can measure hp_, , based on training data sent from user B and can use this 


information to design the transmit beamformer f = hé_.4/||ba—all, of course, this requires the 
channel to be constant during the total time for transmission from B to A to B, called the ping-pong 
time. With reciprocity, channel state information from A to B is obtained for free from the channel 
measurement from B to A. 


Wireless systems that exploit reciprocity typically use time-division duplex systems where the 
same f, is used for transmission from A to B and from B to A. It may be possible to use different 


carriers if the frequencies are close together [199], but this is still an ongoing topic of research. 


The main advantages of reciprocity-based beamforming are (a) that no quantization of the channel 
or beamforming vector is required and (b) the training overhead is reduced since estimating the 
channel from J, transmit antennas takes approximately N, times more training than estimating the 


channel from one antenna. The main disadvantage is that a calibration phase 1s required because of 
differences in the analog front ends between the transmit and receive paths, especially differences 
between the power amplifier output impedance and the input impedance of the local noise amplifier 
[37]. 

There are two approaches for calibration: self-calibration [201, 202] and over-the-air calibration; 
see, for example, [305]. With self-calibration, an extra transceiver is used to perform the calibration 
[201, 202]. The extra transceiver is used to determine differences between the transmit and receive 
paths successively for each antenna during a special calibration transmission phase. In the over-the- 
air approach, the received channel is fed back to the transmitter, so that the transmitter can determine 
the required calibration parameters in the baseband [305]. Note that the required channel accuracy is 
quite high when performing calibration, versus what is required for limited feedback beamforming, 
but the calibration occurs less frequently (on the order of seconds, minutes, or longer). 


6.3.6 The Alamouti Code 


It is possible to obtain diversity from multiple transmit antennas without channel state information 
when the transmitted signal is specially designed. Such a construction is generally known as space- 
time coding [325, 326]. The Alamouti code [8] 1s perhaps the most elegant and commercially 
successful space-time code, and it appears in the WCDMA standard [153]. 


The Alamouti code is a space-time code designed for two transmit antennas. To simplify the 
exposition, we focus on the time instants 7 = 0 and n = 1. At time instant = 0, transmit antenna 1 
sends s[0] and transmit antenna 2 sends s[1]. The corresponding received signal 1s 


y[0] = (Ay s[0] + hes{1]) + v[0]. (6.70) 


At time instant 7 = 1, transmit antenna 1 sends (—s*[1]) and transmit antenna 2 sends s*[0]. The 


corresponding received signal is 


y(1] = (—Ais*[1] + hos*[0]) + v1]. (6.71) 


Taking the conjugate of (6.71) leads to an equation in the non-conjugated symbols: 


y™(1] = (—Ajs[1] + h3s[0]) + v* [1]. (6.72) 
Combining (6.70) and (6.72) into a vector input-output relationship gives 
0 h ; s|O v\0 gee 
eS ae ee le (6.73) 
y*(1| hs —hi} |s(1] v*{1] 
Ne A a ee 
¥ H s Vv 
or in matrix form 
y = Hs-+ v. (6.74) 
Multiplying both sides on the left by H* gives 
H*y = H"Hs+ H’v. (6.75) 


The resulting H has a special structure. Observe 


A> 


|hy|? i 


Agha — 


_ |ha|? + 
7 0 
The columns of H are orthogonal! 


Because of the structure of H, the filtered nois 
are complex Gaussian (because linear combinati 


E|H*v] = 0 and the covariance is 


E |H*vv*H] 


Therefore, the entries of H*v are IID with Ne(0 
simplifies as 


that 


ht ho} {hy he 
i 7 6.76 
hs 4 . —hi ea) 
lhg|? hihg —hjhe sce 
: : 6.77 
hyhs— |hy|? + |ho|? euttt) 
|ho|? 0) oe 
i 6.78 
Jhy|? + |hg|? ( ) 


e term H*v also has a special structure. The entries 
ons of Gaussians are Gaussian). The mean is 


H*E |vv*|H (6.79) 
N.H*H (6.80) 
No(\ha|? + |ha|?)I. (6.81) 


, No|hi|? + |ho|*) Given the structure of H, (6.75) 


H*y = (\hil|? + |he|?)s + H*v. (6.82) 


Because the noise terms are independent, the decoding of (6.82) can proceed independently on 
each entry (this property is called single-symbol decoding) without sacrificing performance. The 
symbols can then be found by solving the ML detection problems: 


§, = arg min | hy ho| y — (|hy|? + |he|7)s : (6.83) 

sEC 
$9 = arg min | hi -hy| y — (|hi|? + |he|7)s A (6.84) 

sEC 

To analyze the performance of the Alamouti transmit diversity technique, consider the post- 
processing SNR derived from (6.82) and (6.81): 
aarmAla  ({1|2 + |hal?)? = 
SNRAla = (fal! + [hal)” 6.85 
a No|hy|? + lhe? yea) 
l , 9 9 , 5" 
= AT (lal + |ha|*) (6.86) 
= Ne |[hs||? (6.87) 
No 

G 9 sews 
= i 6.88 
NoN, Ibs ( 


This SNR is the same as the post-processing SNR achieved with MRT in (6.64) with a factor of N, = 
2 in the denominator. Because 10 log), 2 ~ 3dB, the performance achieved by the Alamouti code is 


3dB worse than that achieved by MRT. The penalty is a result of not using channel state information at 
the transmitter and may be considered a fair trade-off. An illustration of the performance differences 
between the Alamouti code and MRT is provided in Figure 6.5. 


The average SNR and bounds on the probability of symbol error for the Alamouti code can be 
characterized using the derivations in Section 6.2.3. It achieves a diversity order of 2 in Rayleigh 
fading channels. Characterizations of performance in other fading channels are widely available [346, 
a, 245] . 

The Alamouti code achieves full diversity (2 when N, = 2 and N, = 1), sends two symbols in two 


time periods (called the full-rate property in space-time parlance), and has single-symbol 
decodability (the fact that the detection of each symbol can be performed separately without any loss 
of optimality as in (6.83) and (6.84)). Generalizations of the Alamouti code to larger numbers of 
antennas, for example, orthogonal designs [327] or quasi-orthogonal designs [163], sacrifice one or 
more of these properties. 


6.3.7 Space-Time Coding 


The Alamouti code is a special case of what is known as a space-time code [326]. A space-time code 
is a type of code that is usually designed to obtain diversity advantage from multiple transmit 
antennas. This section explains some general concepts of space-time codes and their performance 
analysis in IID Rayleigh fading channels using the pairwise error probability. 

A codebook is a generalization of a constellation. With a constellation, a sequence of B bits is 
mapped to one of the M = 2% complex symbols in the constellation C. In a general space-time code, 
the codebook takes the place of the constellation and codewords take the place of symbols. Ina 
space-time code, each codeword can be visualized as an N, < Nooge matrix where Nooge 18 the number 


of temporal symbol periods used by the code. Let us denote this codebook as S'and the k entry of the 
codebook as S;. We assume a normalization of the codewords such that E{tr(S;Sx)] = NtNeode, 


Example 6.7 
Determine the codebook for the Alamouti code when BPSK symbols are used. 
Answer: The general form of the Alamouti codeword is 


= S] —S5 
S= hi 
59 Sj 


where s, ands s, are constellation points. Ifs,; and s, come froma BPSK constellation, then there is a 


(6.89) 


total of four possible codewords. Enumerating each possible value of s, and s leads to 


Now consider the received signal assuming that codeword S = [s[0], s[1], . .- , SLNooge ~ 1]] 1s 
transmitted over a MISO flat-fading channel: 
y(n] = h*s[n] + v[n] nm =0,1,...,Neode — 1. (6.91) 


This can be written compactly in matrix form by collecting adjacent observations as columns 


Y=h*S+V (6.92) 


where Y and V are | x Nog. matrices. We use this approach because the derivations below extend to 
MIMO channels directly using H in place of h* and making Y and V N, = N,.qe matrices. 


An alternative vector form is achieved by stacking all the columns using the vec(Y) operator. The 
vec operator generates a vector by stacking the columns of a matrix on top of each other. The vec 
often shows up with the Kronecker product ®. The Kronecker product of an N x M matrix A and a P x 
O matrix B is 


aiBi ::- amB 
A®B= es (6.93) 
aniB ::- anuB 
A useful identity is the fact that 
vec(ABC) = (C’ @ A)vec(B). (6.94) 
Using these definitions to rewrite (6.92) gives 
= (Ivy, ®h*)s+v (6.95) 


where y = vec(Y) and s = vec(S). 

The maximum likelihood detector for a space-time code 1s derived in a similar way to the scalar 
decoder from Chapter 4, except using the multivariate Gaussian distribution from Chapter 3 and the 
vector equation in (6.95). A related derivation for spatial multiplexing is provided in Section 6.4.3, 
which can be applied directly to (6.95). The detailed derivation for the space-time-coded case 1s 
presented as a problem. The resulting detector is 


S = arg min |ly — (I ® h*)vec(Q)|| (6.96) 
QeES 


It can be rewritten more compactly using (6.92) as 


S = arg min |/Y — h*Q||%,.. (6.97) 
Qes 


The optimum detector searches over all possible transmitted codewords to find the one that was most 
likely transmitted. The complexity of the brute-force search over N.,ge entries can be reduced when 


there is structure in the code, such as orthogonal structure as found in the Alamouti code or trellis 
structure [326]. 


The performance of a space-time code does not just depend on the channel through the post- 
processing SNR as in the case of beamforming; it also depends on the spatial structure of the 


codebook. Given a realization of h, the conditional probability of codeword error P(Ex/No|b) can 
be upper bounded using similar arguments to those in Chapter 4 using the pairwise error probability. 
The detailed derivation is provided in the literature [326, 134]. The pairwise error probability, in 
this case taking G = E,, 1 


/ 
P(Qy > Qu|h) = Q CE. V/sarlla (Qe - 1 QI) | (6.98) 


Looking for the worst error event and inserting into the union bound gives 


P(E,./No|h) < (Neode — 1)Q (Vax a es _||h*(Qx — Qe)||2 2). (6.99) 
4%o0 “ek 


In the SISO case studied in Chapter 4, the upper bound depended only on the minimum distance of the 
constellation. The same 1s true for beamforming as seen in Example 6.6. In (6.99), however, the 
channel and the codewords are coupled together. This is a main difference between space-time 
coding and other diversity techniques. As a result, performance depends in particular on the structure 
and design of the codebook. 

The average probability of error can be used to devise criteria for good codebooks in fading 
channels. We consider the specific case of IID Rayleigh fading channels with Gyno = E,/N, to 
neglect the large-scale fading contribution. Taking the expectation of (6.98) and writing the Chernoff 
upper bound: 


Neode — 1 — —+— |i * / —O,;)I}? : 
En (P(Qi — Qe|h)] < “ Ey, le aN IIb” (Qu—-Q. | (6.100) 
Ne de 1 — — \( — | eg ’ 
me Er le a aalatial F (6.101) 


Let Ry ¢ = (Q — Q7)(Q; — Qz)*, which is called the error covariance matrix. Using the definition of 


the moment-generating function of a multivariate Gaussian distribution (related to the characteristic 
function; see [332] for more details), and substituting in for Gyo, it can be shown that 


CNesas = 1) 1 


Ep [P(Qs — Qelh)| 5 _ a) 
° 1 + tren; Re, 
Neode =u 1 ’ 
= ee (6.103) 
~ I eae 7 (1+ AN, N, THN; Am (R,e) ) 


(6.102) 
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where the second step uses the fact that the determinant of a positive definite matrix is the product of 
its eigenvalues and that the eigenvalues of a positive definite matrix of the form I+ R are 1 +A,,(R). 


The worst-case error is used to upper bound the average probability of error as 


/ r ' Neoc e~ i Neode =] , 
Ey, [P(E,./No|h)] < ( > ) max . <a ; = (6.104) 
7 j lat io (1 a AN, ! IN, No “Am (Re, ‘)) 
Neode —1 Neox e~ i! — 
~ (Ne > mi La rank(Riy) . (6.105) 
- - z et! eorank(Re,¢) ' 
(ax) Dom Am (Rc) 


Essentially, what can be gleaned from (6.105) is that the diversity performance of the space-time 
code depends on the worst-case error covariance rank(R, 7). A space-time code with a full-rank error 


covariance for all possible error pairs is known as a full-rank space-time code. The coding gain of 
rank(R,, .¢) / 
the code is related to the term | Im=1 Am(Rx.¢), For two codes of similar diversity 


performance, the one with the larger coding gain for that minimum-rank codeword will generally have 
better performance. 


Example 6.8 


Determine the diversity performance of the Alamouti space-time code using the pairwise error 
probability (PEP) approach. 


Answer: Evaluating the diversity performance using PEP requires computing the error covariance 
matrix. In general this is challenging, but for special codes like the Alamouti code the task can be 
accomplished. Because of the structure of the Alamouti code, using (6.89), the codeword difference 
has the form 


1 3") gi —(s" si) — 
Qt — Qe = 75, sf) 5 (g() _ Oye | . (6.106) 
Following a similar derivation to that in (6.78), 
Rae = 5 (Qk — Qe)(Qu — Qe)" (6.107) 
= : (Is — |? + lag — |?) 1. (6.108) 


Because rank(I) = 2, as long there 1s at least one error difference, the error covariance matrix is 
always full rank and the Alamouti code achieves a diversity of 2. 


The design of space-time codes has been an active area of interest. The problem is nontrivial since 
the properties of the code depend on different properties of the error covariance matrices. The rank 
and determinant criteria for code design were proposed in [326] where space-time trellis codes were 
derived. Delay diversity [357] (see Figure 6.4) turns out to be a special case in [326]. A variation of 
delay diversity called cyclic delay diversity [83, 133] 1s used in IEEE 802.11n. 


6.4 MIMO Transceiver Techniques 


MIMO communication makes full use of multiple transmit antennas and receive antennas. MIMO 
provides a generalization of both SIMO and MISO systems. It also introduces the spatial multiplexing 
mode of communication, where the objective is to send many symbols simultaneously, taking 
advantage of the higher capacity that is generally present in MIMO channels. In this section, we 
introduce the spatial multiplexing concept. We explain how to generalize maximum likelihood symbol 
detection and equalization to the MIMO flat-fading channel and provide some performance analysis 
for IID Rayleigh fading channels. Using knowledge of the channel at the transmitter, we show how to 
generalize transmit beamforming to the MIMO case, in what is called transmit precoding, and its 
extension to limited feedback. We develop channel estimators for MIMO flat fading, as detection, 
equalization, and precoding all require an estimate of the channel. Finally, we provide some 
extensions of time-domain equalization and channel estimation to MIMO frequency-selective 
channels. 


6.4.1 Spatial Multiplexing 


Spatial multiplexing is a communication technique where independent symbols are sent from each 
transmit antenna. A description of spatial multiplexing was provided in Section 6.1.3, emphasizing 
the application to two transmit antennas. Now we explain the extension to N, transmit antennas and 


describe how it performs with different receiver algorithms. 


An illustration of spatial multiplexing is provided in Figure 6.13. Consider a symbol stream s[”] as 
in earlier chapters. The spatial multiplexer operates as a | : N, serial-to-parallel converter. The 


outputs of the spatial multiplexer are the symbol streams denoted as s,[n] = s[Nn + (7 — 1)] forj=1,. 
.., N,. These symbols are pulse shaped and scaled for transmission to generate a signal 


Coo 


ft) = V N, >. s;[n]gtx(t — nT), (6.109) 


S;[n] = s[N,n] 
$5[n] = s[Nn + 1] 
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Figure 6.13 The operation of spatial multiplexing at the transmitter. Spatial multiplexing is 
essentially a serial-to-parallel conversion that takes symbols and produces subsymbols or streams. 
The output symbols are then pulse shaped, converted to continuous time, and upconverted. 


which is analogous to x(t) in (4.3) in Chapter 4. The only difference is the amplitude scaling. To keep 
the total transmitted power constant, the symbols must be scaled by V Ex/ Nt instead of by Vv Ex In 


N; / 9 
this way, do j=1 Ella; (t)/"] = Ex 


Let h; ; denote the complex baseband equivalent flat-fading channel between the j transmit antenna 


and the i” receive antenna. Suppose that this channel includes large-scale fading, small-scale fading, 
and thus the scaling by £,/N,. The baseband received signal, assuming perfect synchronization and 


sampling, is 
yi[n] = (hy181[n] + hy,2se[n] +--+ hin, sn, [nr]) + v;[n] (6.110) 


where v,[”] 1s the usual AWGN with Ne (0, No), Stacking all the received observations into a vector 


gives 


=I es os oo « Vl « tl ¢ (6.111) 
yn, [1] hy.a hn. ++: An... | | SN,[7] Un, [n] 
——————————SS SSS SSS 
y(n] H s(n] v[n] 
leading to the classic MIMO equation 
y(n| = Hs[n] + v{nl. (6.112) 


A simplified system block diagram of a MIMO communication system is provided in Figure 6.14. 
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Figure 6.14 A block diagram for a complete spatial multiplexing system including the transmitter 
and the receiver. The bits are mapped to symbols and then stacked into vectors at the transmitter. 
The entries of each vector are then transmitted separately on each transmit antenna. At the receiver, 
the outputs from each antenna may be synchronized separately, and then joint equalization and 
detection are used to generate an estimate of the transmitted symbol vector. The upsampling and 
downsampling are not shown explicitly in the figure. 


The receiver 1s tasked with detecting the symbol vector s[] from the observation y[n]. This can be 
performed based on different criteria as described in the next section. 


6.4.2 MIMO Flat-Fading Channel Models 
A MIMO flat-fading channel is represented by the Nx N, matrix H. Each column of [H].; 


corresponds to the SIMO channel experienced by the transmission from the j" transmit antenna. Each 
row of [H];. is likewise the MISO channel from the transmitter to the i” receive antenna. MIMO 
channel models therefore share similarities with SIMO and MISO models. 


Note that the dimensions of H are N, xN,. It is common, though, to refer to a MIMO system with N, 
transmit antennas and WN, receive antennas as an N, x N, system. For example, a 2 x 3 system has two 
transmit antennas and three receive antennas. 

Assuming that the antennas are colocated (in an area separated by a few wavelengths), the path loss 
between each transmit and receive antenna can be assumed to be the same factor P,;,,(d). As a result, 


all the large-scale path-loss models apply to the MIMO case as well without any changes. There are 
some further generalizations, for example, to LOS MIMO, distributed antennas, or cloud radio access 
networks, where antennas may have different geographic locations and a more general model might 
be required. 


Then the channel may be decomposed as in the SIMO and MISO cases as a product of aee scale 


and small-scale components as H = y ‘GyumoHs, As in the MISO case, GMIMo = F Tn Nt 
with P,,(d@) capturing the large-scale path-loss effects. The small-scale fading matrix H, captures all 
the effects of spatial selectivity in the channel. 

There are many stochastic models for MIMO communication systems, which treat H, as a random 
variable or construct H, as a function of random variables. Often we describe the distribution of H, 
in terms of h, = vec(H,), to leverage existing multivariate distributions (which apply to vectors and 
not matrices). 

The most widely used model for analysis of MIMO communication systems is the IID Rayleigh 
fading channel. In this case, [H,]; ; iS Ne (0, 1) and is IID. In other words, h, has a multivariate 
Gaussian distribution “Vc (9, Lv... ), The ID Rayleigh model is usually justified by the “rich 
scattering assumption” [115]. The main argument is that there are many scatters in the environment, 
which leads to Gaussianity by the central limit theorem, and the antennas are spaced far enough apart, 


which justifies the uncorrelated (and thus independent in a Gaussian channel) assumption. We use the 
IID Rayleigh fading channel for the analysis in this chapter. 


The most common generalization of the IID Rayleigh fading channel 1s to incorporate spatial 
correlation, in what is called the Kronecker channel model or separable correlation model. Let R,, 
denote the transmit spatial correlation matrix, let R. be the receive spatial correlation matrix, and let 
H,, be an IID Rayleigh fading matrix. With the Kronecker model, 


H, = R1/?H, Ri’. (6.113) 


The transmit correlation matrix captures the effects of scattering around the transmit antenna array, 
and the receive spatial correlation matrix captures scattering around the receive antenna array. This is 


Ff T « 
called the Kronecker channel model because the resulting h, is distributed as Nc (0, Rix @ Rrx), 


which gives a Kronecker structure to the spatial correlation matrix of the vectorized channel E|hsh?} 
Further generalizations are possible where the correlation is not restricted to having Kronecker 
structure [295, 350]. 


The clustered channel model can be used to separately generate the transmit and receive 
covariance matrices using, for example, closed form expressions like those found in [298] or [113]. 
Alternatively, the cluster parameters can be generated randomly, and H, can be simulated as a 


generalization of (6.12) as 
C-1R-1 


l : . 
H, = —— pipes Oey Wet Wee) 6.114) 
= JRC y, » a ( ) [> (@ ) ( 


“ c=0 r=0 


There are other generalizations that involve asymmetric numbers of clusters on the transmit and 
receive sides. A variation of this sum-of-rays-type channel model is used, for example, in the 3GPP 
spatial channel model. The approach based on the exact computation of the spatial correlation matrix 
is used in the IEEE 802.11n standard [96]. 


6.4.3 Detection and Equalization for Spatial Multiplexing 


In this section we review the maximum likelihood and zero-forcing (ZF) detectors and analyze their 
performance in IID Rayleigh fading channels. 


Maximum Likelihood Detector for Spatial Multiplexing 
Suppose that all the symbols s,[n] come from the same constellation C. It is possible to generalize to 


multiple constellations (though uncommon in practice) as explored in the problems at the end of this 
chapter. The constellation formed by the resulting s[7] is called the vector constellation. It consists of 
all the vectors with every possible symbol and is denoted as S. Essentially, S= Cx C--- x C. The 
cardinality of S is 


IS| = |e|**. (6.115) 


The size of the vector constellation thus grows exponentially with N,, leading to very large 
constellations! 


Example 6.9 
Illustrate the vector constellation for V, = 2 and BPSK modulation. 


Answer: With BPSK, C= {—1, 1}. Forming all combinations of symbols, 


(CGE 


ES 


The maximum likelihood detection problem is to determine the best s|n| that maximizes the 
likelihood of y[7] given a candidate vector symbol S[”| and H. The derivation is similar to that in 


Chapter 4. First note that v[7] is multivariate Gaussian with distribution Ne (0, Nol), Therefore, the 
likelihood function is given by (using (3.300)) 


fyjti.s(y[n]|s[n] = 8,H) = ——e lB), (6.117) 


The maximum likelihood detector solves 


sin] = arg max fyjH_s(y[n||s[n] = 5, H) (6.118) 
ses 
1 = s _H)*(yin)— 
= arg max——_€ a Ovi] —Eia)" (y[re] =). (6.119) 
sceS al No . 


ae 
Since the 7‘: N.** does not impact the maximizer, and recalling that the exponential function is 


monotonically increasing, 


s[n] = arg max — (y|n] — Hs)* (y[{n] — Hs) (6.120) 
seS +‘o 
= arg min (y(n] — Hs)” (y{n] — Hs) (6.121) 
seES 


Rewriting the inner product in (6.121) as a norm gives the main result: 


s|n| = ar arg Ii min ||y[n] — Hs|\*. (6.122 


‘ , 


Based on (6.122), the ML decoder for spatial multiplexing performs a brute-force search over all 
possible vector symbols |C}‘t. Each step involves a matrix multiplication, vector difference, and 
vector norm operation. The complexity can be reduced somewhat if the channel remains constant over 
many symbol periods. Then a distorted vector constellation can be computed ahead of time for the 
entire block by precomputing Hs for all s © S and using this constellation to avoid the product in 
(6.122). The disadvantage is the increase in storage required. There are several low-complexity 
algorithms that approximate the solution to (6.122), including the sphere decoder [140, 347, 24]. 


The performance of spatial multiplexing can be evaluated using pairwise error probability in a 
similar way as is performed for space-time codes in Section 6.3.7. First we consider the case of 
AWGN. For this performance analysis we consider just small-scale fading and let G = E,/N,. Let s 


€ § denote the codeword transmitted, and let s“ denote the codeword that is decoded. Assuming the 
ML decoder is used, the pairwise error probability in the AWGN channel is 


| Ey || Hs(*) - Hs‘) ||2 


P(s™ — s©|/H) =Q | 4) ON.N, ; (6.123) 
The probability of vector symbol error is then bounded as 
| EE, 7 
N |) < (|S| = 1) ) ae Q (y ON.N, \|Hs(*) — Hs?) (6.124) 
woah | ol 
< (|S| —1)Q ( ON. min, || Hs(*) — Hs?) (6.125) 


The performance of spatial multiplexing depends on the minimum distance of the distorted vector 
constellation given by min, ¢ x z¢ Hs = Hs. In a SISO system, the minimum distance is a 


function only of the constellation; in the MIMO case, it is also a function of the channel. The channel 
distorts the vector constellation and changes the distance properties. Having performance that 
depends on the channel realization makes performance analysis more challenging since it is not 
simply a function of a post-processing SNR. 


To see how the channel impacts performance, let e“ = s“ — s. Then the error term becomes 
Hs — Hs ||? = ||He™ |? (6.126) 


If the channel H is low rank, then it is possible that one of the error vectors lies in the null space of 
H, which would make \He® OP = (). This makes the upper bound equal to (|S| — 1)0.5. Alternatively, 
suppose that the channel is a scaled unitary matrix that satisfies H*H = cI. Then, because of unitary 
invariance, it follows that \He® 9? = a le“ OP and good performance is guaranteed, as long as the 
error vector is nonzero. In this case, the channel rotates all the error vectors by the same amount, 
preserving their distance properties. 

Now we compute an upper bound on the probability of error for spatial multiplexing with a 
maximum likelihood receiver in an IID Rayleigh fading channel. To solve this using a similar 
calculation to that in (6.100) it is useful to rewrite the error term as 


He ||? — |lvec(He* )||? (6.127) 
where the last step follows from the fact that for a vector x, vec(x) = x. Using the identity in (6.94), 
He ||? = |\(e*9 @ In, )vec(H)||?. (6.128) 
Defining h = vec(H), and expanding the inner product and using the distributive property of the 
Kronecker product, 


||He*-® ||? = h*(e\®e®9 T @ Ty yh. (6.129) 


Let the error covariance matrix for spatial multiplexing be defined as R, = ee“ 7, Following 
the same steps as in (6.100)—(6.102): 


- 71 1 oe 

Ey |P(s > s\|H) <5 (6.130) 
211+ af Rie @Iy, | 

1 1 a 

a (6.131) 
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The last step follows from two facts. First, rank(R, ¢) = 1 and its only nonzero eigenvalue is ye OP. 
Second, the eigenvalues of A ® B are A,(A) ,,(B). Finally, the smallest value of je“)? is the dyin Of 
the component constellation C. 


The main intuition derived from (6.131) is that the spatial multiplexing with a maximum likelihood 
receiver obtains at most a diversity gain of N.. There is no diversity obtained from having multiple 


transmit antennas. The reason is that the information sent across each transmit antenna 1s effectively 


independent since the constituent symbols s[n] are all IID. Since there is no redundancy in space, the 
rank of the error covariance matrix R, ¢ is 1. Itis possible to derive codes that combine space-time 


coding and spatial multiplexing to achieve a diversity as high as N,N, [149, 214, 31]. 


Spatial multiplexing with a maximum likelihood receiver is usually the benchmark receiver for 
performance comparisons. In Rayleigh fading channels it achieves a diversity order of N,, which is 


the maximum that can be achieved by lower-complexity receivers (without channel state information 
from the transmitter or more sophisticated kinds of space-time coding). The main drawback of the 
maximum likelihood receiver is the requirement for a brute-force search over all the transmitted 
symbol vectors. This leads to the study of other low-complexity receiver techniques like the zero- 
forcing receiver. 


Zero-Forcing Detector for Spatial Multiplexing 

The zero-forcing detector is one of the simplest yet most effective receivers for a spatial multiplexing 
system. Consider the system in (6.112). Now suppose that H is full rank and that NV. > N,. From 
Chapter 3, the least squares estimate is 


S[n] = (H*H)~*H*y([n] (6.132) 


where (H*H) 'H* is simply the pseudo-inverse of H given by H’. 
In the presence of noise, the equalized received signal is 


The zero-forcing receiver detects each symbol stream separately, neglecting the fact that the noise is 


now spatially correlated. Let z = H'y[n]. The zero-forcing detector then computes for j= 1,2,..., 
M 


3,(n] = arg min |z,[n] — ¢| (6.134) 
ceEc 

separately for each entry. In terms of search complexity, the zero-forcing detector computes JN, 

searches over the |C| constellation symbols, computing a scalar magnitude operation for each entry. In 


contrast, the maximum likelihood detector computes a search over |C\‘ vector symbols and has to 
compute a difference and norm of length NV, for each entry. The complexity for zero forcing is much 


reduced compared with that of the maximum likelihood solution. 


Performance of the zero-forcing detector can be analyzed through the probability of stream symbol 
error. A vector symbol error occurs when there is one or more stream symbol errors. The post- 


processing SNR for the j" symbol stream can be computed from 
z;[n] = s;[n] + [H'v[n]];. (6.135) 


The noise is zero-mean complex Gaussian with covariance matrix N,H"H"*. Using the pseudo- 
inverse definition, 


H'H! * = (H*H)~'H*H(H*H)~! (6.136) 


= (H*H)~'. (6.137) 
Therefore, the variance of [H'v[n]] |; 1S [(H*H) |] |; and the post-processing SNR is given by 


1 


— (6.138) 
No|(H*H)-*|33 | 


SNRZF = 


This shows that the performance of the ZF receiver depends on the “invertibility” of the channel. If 
the channel was ill conditioned, then the inverse would “blow up,” creating a very small post- 
processing SNR. 


Example 6.10 
Consider a 2 x 2 MIMO system with channel matrices given as follows: 


el T / 3 el T / 2 
A = | jx/7 (6.139) 


n/7 en jr/4 


eit/6 05a)" 
Hn | | ' | (6.140) 


eit/5 0.1 ¢i7/8 


First, compute the zero-forcing equalizers for the two channels, and after that calculate the post- 
processing SNR for each channel, assuming 1/N, = 10dB. 


Answer: The ZF equalizer is just the pseudo-inverse of the channel matrix, which is the same as 
the normal inverse for the given 2 x 2 matrices. For the first channel matrix, 


1 p—jn/4 = _ejn/2 
Hy = Gasenin/t — einen P2 | . | (6.141) 


el Tv / 3 e — j Tv / 4 mes e) Tv / fi el Tv / 2 a—3 el 7T / 7 PA Tv 4 3 
| e ae 4 sont el T / 2 : 
= ie ... Aone enti ‘7/3 . (6.142) 
elt /he — e)9T/ —e™ el” 
Similarly, 
7 1 0.1e)7/8 0.5 : 
H,' = —_; = aie. seein (6.143) 
. 0.1ei77/24 4 0. 5ei7/5 | —e)7/9 elm /6 


The post-processing SNR for the /" stream is given by (6.138). Plugging in N, = 2 and 1/N, = 10dB, 
for H,, 


semen 11.8591 10.74dB 
SNRZF(H,) = _ 6.144 
(Hi) ro re ( ) 
he ‘ — = 7 140 
. 1.7794 2.50dB 


The probability of error inan AWGN channel is obtained by inserting the post-processing SNR in 
(6.138) into the expressions in Section 4.4.5. Analyzing the average probability of error is more 
complicated. It can be shown for Rayleigh fading channels that (6.138) is a (scaled) chi-squared 
random variable with 2(N, — N, + 1) degrees of freedom [132]. This result can be used to show that 


oN Pp ZF 
SNR" has an equivalent distribution as a SIMO system with N, —N, + 1 antennas but with G 
replaced by Gyo. AS a result, the diversity advantage is NV. — N,+ 1. If V,= MN, then all the 


dimensions of the channel are used in the inverse and there is no “excess” for diversity advantage. 
Interestingly, if there is even just one excess antenna NV. = N, + 1, then zero forcing achieves second- 


order diversity. The diversity from just a single extra antenna is shared by all the symbol streams to 
improve performance. Because diversity effects offer dimensioning returns, a ZF receiver with a few 
excess antennas can perform close to the maximum likelihood receiver despite its much lower 
complexity. 

The performance analysis results for spatial multiplexing show that neither maximum likelihood 
nor zero forcing benefits from having an excess number of transmit antennas. With ML detection, the 
diversity advantage depends only on N,. With the ZF receiver, the diversity gain is reduced if large N, 


is chosen and N, > N, is not feasible. Additional gains from multiple transmit antennas can be 
obtained by using, for example, transmit precoding. 


6.4.4 Linear Precoding 

It is possible to exploit channel state information at the transmitter in MIMO spatial multiplexing 
systems using linear transmit precoding. A linear precoder is a matrix that maps N, streams onto N, 
transmit antennas. Most applications of linear precoding focus on the case where N, < N, and use 
channel state information to design the precoder. The special case of N, = 1 corresponds to sending a 
single stream using beamforming. 

A spatial multiplexing system with linear precoding is illustrated in Figure 6.15. Let F denote the 
N,  N, precoding matrix. The precoding matrix is applied to the NV, x 1 symbol vector s[n]. There are 
different ways to normalize the precoder F. In this section, we consider the case where tr(FF*) = N, 
to match our assumptions earlier where the scaling per antenna branch is E£,/N,. Under this 
assumption, the classed MIMO received signal equation is 


y(n] = HFs/[n] + vin]. (6.146) 
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Figure 6.15 Spatial multiplexing at the transmitter with linear precoding. The spatial multiplexer 
produces a symbol vector of dimension N, = 1. The precoding operation multiplies the symbol 


vector by a precoding matrix F, usually determined by the channel state, to produce the precoded 
output. The outputs are subsequently pulse shaped, converted to continuous time, and upconverted. 


Other normalizations of the precoder are also possible if, for example, different powers are allocated 
to the different symbols [71, 370]. Additional constraints might also be imposed, for example, a per- 
antenna power constraint [71 ]. 


A special case of linear precoding is known generally as e1genbeamforming. To inform the 
construction, it is useful to have some background on the singular value decomposition (SVD) of a 


matrix. Any nonsquare N x M matrix A can be factored as 


A = Urai=Vi- (6.147) 


The matrix U,, is unitary NV x N, V_,, 18 unitary M = M, and X is an N x M matrix with ordered (from 
largest to smallest) nonnegative entries {a,} (called singular values) on the main diagonal and zero 
elsewhere. The columns of Us, are called the left singular vectors of A and the columns of V;,y, are 


called the right singular vectors of A. The squared singular values of A are the eigenvalues of AA* 
and A*A. 


Suppose that the SVD of H = UXV™. Let F=[V] LN, Substituting into (6.146), 
y|[n| = ULV" [V]..1:~,s[n] + vin]. (6.148) 


Multiplying the received signal vector by U* gives 


U*y[n] = USUSV"[V].1-v,s[n] + U*v[n] (6.149) 
= DV" |V].1-v,s[n] + U* vin] (6.150) 
= [3S]: 1:v,s[n] + U*v[n] (6.151) 


where (6.150) follows because U is unitary; and (6.151) follows because V is unitary; thus V* 


[Vv] SN, = [T] LN, Also because U is unitary, U*v[1] remains Nc(0, NoD) since 


E[U*v|n}v*[n]U] = Nol Unlike the zero-forcing case, the noise remains IID. Defining z[n] = 
U*y[n] and using v[7] in place of U*v[] since they have the same distribution gives the equivalent 
received signal 


z[n| = [S]-.1:v,s[n] + v[n]. (6.152) 


It is easy to decode the i” stream because all the symbols are decoupled: 


Zj n = o;8;|n| + Vj [n]. (6. 153) 
Unlike the case of zero forcing, there is no loss in performing the stream detection independently 
since the noise remains IID in this case. Consequently, the performance of the optimum detector is 
dramatically simplified. 

Linear precoding as described in this section has a nice interpretation. Effectively with the chosen 
precoder each symbol rides along its own eigenmode (singular vector) of the channel using a 
beamforming vector f; = [F].; chosen to excite that mode. Linear precoding effectively couples 


symbols into the MIMO channel along its preferred “directions.” 
The performance of the j" stream depends on the post-processing SNR 
1 4 


SNR = 57-9}; (6.154) 


which depends only on the singular values of the channel. If V, = N,, linear precoding provides 


performance that is similar to that of the zero-forcing decoder. Precoding, though, can be tuned to give 
better performance by adapting the rate and power per stream. The benefits of precoding appear when 
used with NV, < N,. For example, consider a 4 x 4 channel H that has only two significant singular 
values. By choosing N, = 2, the transmitter can send information on the two best eigenmodes and can 
avoid wasting effort on the two bad ones. When JN, is chosen dynamically as the channel changes, this 
is known as rank adaptation or multimode precoding [206, 148, 179]. 

The advantages of precoding are also realized in terms of its diversity performance. If N, < N, and 
N, < N,, then in Rayleigh fading channels precoding achieves a diversity order of (NV, — N, + 1)(V,- 
Ne 1) (210), 


In this case diversity is obtained both from excess transmit and excess receive antennas. If 
multimode precoding is employed, then a diversity of N,N, can be achieved, which is what is 
achieved using single-stream beamforming. Overall, precoding makes use of the channel to provide 
better performance, make use of extra transmit and receive antennas, and allow optimal low- 
complexity decoding at the receiver. 


Requiring channel state information at the transmitter is a main limitation of linear precoding as 
described in this section. A solution to this problem is to use limited feedback precoding, where the 
transmit precoder is selected from a codebook of possible precoders known to the receiver and the 
transmitter. 


6.4.5 Extensions to Limited Feedback 


Limited feedback precoding is a generalization of limited feedback beamforming. It provides a 
framework for providing channel state information (in the form of a codeword recommendation) from 
the receiver to the transmitter. Limited feedback precoding is found in IEEE 802.11n and 3GPP LTE, 
among other wireless systems. 


A spatial multiplexing system with limited feedback is illustrated in Figure 6.16. Let P= {F), F, .. 
. » Fy,,,§ denote a codebook of precoding matrices. Each entry F; 1s an N, x N, precoding matrix that 
satisfies the appropriate precoding design constraints (in our case it satisfies the constraint that the 


columns be unit norm and orthogonal). 
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Figure 6.16 Spatial multiplexing at the transmitter with limited feedback linear precoding. The 
spatial multiplexer produces a symbol vector of dimension N, x 1. The precoding operation 
multiplies the symbol vector by a precoding matrix F, which comes from a codebook of precoding 
vectors. An index sent from the receiver determines which precoder in the codebook is selected. 


Selecting the optimum codeword from the codebook depends on the chosen receiver design. There 
are different criteria, and each is reasonable under different assumptions [210]. For example, if a 
zero-forcing receiver is employed, one approach would be to maximize the minimum post-processing 
SNR given the composite channel HF’, as 


Ex 

= eremiaex min a (6.155) 
k=1,2.,...,NLE m=1,2,...,Ne NiNo|(F;H*HF;) - 5 fee : 

= argmin max  |(F,H*HF;) — (6.156) 
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Another approach, inspired by (6.154), is to maximize the minimum singular value of HF,. 


The diversity obtained using limited feedback precoding is the same as that achieved with perfect 
channel state information if the codebook is large enough. With a good design, choosing N; ¢ = N, 1s 


all that is required [210]. 


Antenna subset selection is a special case of limited feedback precoding [146, 228, 207]. With 


antenna subset selection, the precoding matrices are chosen from subsets of columns of the identity 
Ne , 

matrix. The maximum codebook size is ( Ns ) Antenna subset selection can also be viewed as a 

generalization of MISO antenna selection. 


Example 6.11 
Illustrate the limited feedback codebook derived from antenna subset selection, assuming N, = 2 and 
N= 2: 


3\ _« 
Answer: There is a total of (3) = 3 entries in this codebook. The resulting codebook 


1 O i i o 
01 ‘Al 5 0 0 iV i @| >. (6.157) 
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The scaling factor is chosen to ensure that **(F; Px) = Nz. 


There are a number of different possible codebook designs for F’. Perhaps the classic design is 
Grassmannian precoding [210] where the codewords correspond to points on the Grassmann 
manifold. The 3GPP LTE standard uses a nested codebook design that allows the transmitter to select 
a smaller N, than indicated by the receiver [127, Section 7.2.2]. IEEE 802.11n has three feedback 
options: one where the receiver feeds back the quantized channel coefficients directly (full channel 
state information feedback) and two where the receiver first computes beamforming feedback 
matrices and sends the noncompressed version (noncompressed beamforming matrix feedback) or the 
compressed version using Givens rotation (compressed beamforming matrix feedback) [158, Section 
20:3.,12.2 |. 


Example 6.12 


In this example, we compare the performance of different MIMO configurations and performance 
with two transmit antennas and two receive antennas. To make comparisons fair, we keep the rate 
(size of the vector constellation) fixed within each strategy. We compute the average probability of 
symbol error rate assuming an IID Rayleigh channel. We consider four possible cases: 
¢ SIMO with maximum ratio combining (only one transmit antenna is used) with a 16-QAM 
constellation. 
¢ MIMO with transmit and receive antenna selection (select the transmit and receive antennas 
with the best channel) with a 16-QAM constellation. Antenna selection is a special case of 
limited feedback precoding. 
¢ Spatial multiplexing with a zero-forcing receiver and 4-QAM constellations. 
¢ Spatial multiplexing with a maximum likelihood receiver and 4-QAM constellations. 


Answer the following questions based on the performance plot in Figure 6.17. 


- @- Antenna selection — 16-QAM 
|~ @-Zero forcing — 4-QAM 
| - 8 - Maximum likelihood — 4-QAM 


|-™- Optimum combining — 16-QAM 
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Figure 6.17 Symbol error probability comparison for different 2 x 2 MIMO techniques 


¢ Why does antenna selection in the MIMO channel outperform optimum combining in the SIMO 
channel? 


Answer: In the 1 x 2 SIMO channel, there is only second-order diversity. In contrast, since 
there are two transmit antennas and two receive antennas in a 2 x 2 MIMO channel, antenna 
selection provides a diversity advantage of four, which can be seen through similar arguments 
to those for antenna selection in Section 6.2.2. Therefore, at high SNR, the performance curves 
cross and the system with higher diversity has better performance. 

¢ For what SNRs is the antenna selection technique preferred to spatial multiplexing with a zero- 
forcing receiver? 
Answer: Antenna selection has both transmit and receive diversity with a diversity advantage 
of four, whereas spatial multiplexing with a zero-forcing receiver has a diversity advantage of 
only one. From Figure 6.17, antenna selection outperforms zero forcing for all the SNR values 
considered. 


¢ For what SNRs is the antenna selection technique preferred to spatial multiplexing with an ML 


receiver? 


Answer: Figure 6.17 shows that the antenna selection technique 1s preferred to spatial 
multiplexing with an ML receiver when SNR is (approximately) greater than 14.5dB. When the 
SNR is smaller than 14.5dB, spatial multiplexing with an ML receiver is preferred over antenna 
selection. 


¢ What is the performance difference between zero forcing and the maximum likelihood receiver? 


Answer: The maximum likelihood receiver is better than the zero-forcing receiver in terms of 
symbol error rate for all SNR values because of its higher diversity advantage. The difference 
between ML and ZF increases as SNR increases. 


¢ What is the complexity difference between zero forcing and the maximum likelihood receiver? 


Answer: First, there are some calculations that are performed on the block of symbols. Zero 
forcing requires the calculation of a matrix inverse. Maximum likelihood does not require 
initial calculations, but more efficient implementation uses a modified constellation {Hs|s © 
S'. The complexity is essentially then one matrix inverse versus 16 matrix multiplies. 


Second, with zero forcing, two independent detections of 4-QAM require one matrix multiply 
(requiring four multiplies and two adds) and eight scalar minimum distance calculations. With 
maximum likelihood detection, the joint symbol detection requires 16 vector minimum distance 
calculations, which is equivalent to 32 scalar minimum distance calculations. The complexity 
with maximum likelihood detection is at least twice as high as for zero forcing, not including 
the first set of calculations. 


6.4.6 Channel Estimation in MIMO Systems 


All of the SIMO, MISO, and MIMO approaches described thus far require knowledge of the channel 
at the receiver. This section provides some generalizations of the channel estimators described in 
Section 5.1.4 to estimate the MIMO channel. Through suitable choice of N, and N,, these approaches 


can be applied to estimate SIMO and MISO channels. 


Let {tl} | denote a vector of training symbols. We take the vector of training symbols and 
create a training matrix to simplify the exposition: 


T=|t/0] +[1] -- t[Nee — 1]. (6.158) 


The simplest approach for channel estimation is to take turns training each antenna. Suppose that 
N,, 18 divisible by N,, and let {t(n] ase — denote a scalar training sequence. Sending this 
sequence successively on each antenna gives 

tO] --- ¢]M,—1] O -:-: ) 
T=|0 --- 0 tO] --- t{Mr—1] ++], (6.159) 


wes 0 


Consider the received signal during the transmission of the training sequence on transmit antenna / 
at receive antenna i: 


yiln +(¢—1) Ne] = hggt[n| + viln + (i — 1) Ne] (6.160) 
forn=0,1,..., N/N,— 1. Stacking into a vector gives 
Vig = hy gt + Vij. (6.161) 
The least squares solution is 
hag = (t*t)~'t*ymp. (6.162) 


When the training signal is sent from transmit antenna /, the signal received on antenna 7 1s used to 
estimate h, ;. After cycling through all the transmit antennas, the receiver can put together an estimate 


of the entire channel matrix H. 


The problem with this simple approach 1s that it requires transmit antennas to power off and on. 
This may create challenges for the analog front end. It also results in reduced performance when there 
are per-antenna power constraints (when all the transmit antennas are not used, not all the power is 
delivered). A solution is to use a more general training structure. Suppose that a general T is used 
with columns t[]. Then 


y[n] = Ht|[n] + v[n]. (6.163) 
Now, collecting successive vectors together forn =0,1,...,M,—1, 
Y=HT+V. (6.164) 
The least squares solution for H is 
H = YT*(TT")—!. (6.165) 


Further details about this derivation are provided in Example 6.13 and Example 6.14. 


A good design for the training sequence is to have orthogonal sequences sent on each antenna. If 
this is the case, then TT* = N,,I and the least squares channel estimate simplifies as 


H = YT’, (6.166) 


which is essentially a matrix correlation operation. 


Example 6.13 


Consider a MISO frequency-flat slow-fading channel with two transmit antennas and one receive 
antenna. Let h; ; and h, 4 be the channels from the transmit antennas to the receive antenna. Assuming 


that the independent symbols s,[n] and s,[n] are transmitted, the received signal at this antenna is 
given by 


y(n] = hy 81[n] + hy,282/n] + v1 [n], (6.167) 


where v,[n] is the noise at the receive antenna. 
Formulate and solve a least squares channel estimation problem to estimate the channel 
Ner-1 
coefficients of the MISO channel, assuming that we send two training sequences {ti[r]}n=o and 


Nter-1 ; . : 
{t2|n]},“o through the MISO channel at the same time. Be specific about the expression of the 
squared error and the resulting matrix and vector definitions, to provide a point of comparison for the 
MIMO case in the next example. 


Answer: The input-output relationship in the matrix form in the absence of noise is given by 


: t}(O] ¢,[1] +--+ ti[Me—1] ee 
yil0] ++ aifNe i] = [roa aa] fe 6.168 
Ly | yi LN tia 01,2 ia(0] doll] «~~ dalam‘) (6.168) 
< 
yi h{ T 
We write using row vectors to be consistent with the derivation in (6.164). The squared error 
between the observations and the known data for dummy vector a! = [ay, a] is 
Ni rr iT 
ey ». lya(n] — ayty[n] — agta[n]|? (6.169) 
n=0 
= ||yi — Tal’. (6.170) 
The least squares channel estimate is the solution to 
h; = arg min ||y; — T’ all”. (6.171) 
acC?2x! 
Assuming that T is full rank, the least squares MISO channel estimate is 
i =a aor) (6.172) 


where we have written the final solution in the form as in (6.165). 


Example 6.14 


Consider the system in Example 6.13 where the receiver is equipped with one additional antenna to 
forma MIMO 2 x 2 system. The channels from the transmit antennas to the second receive antenna are 


denoted as />, and h»5. The received signal at the second receive antenna is given by 
y2|[n| = ha1si[n] + he,282[n] + ve[n], (6.173) 


where v,[7] is the noise at the second receive antenna. Formulate and solve for the least squares 
1 Ner—1 nl Ner— 
channel estimator using the same two training sequences {ti[r]}n“0 and {t2["]}n=o , chosen such 


that T is full rank. 


Answer: We define y, ina similar fashion as y,, and h, ina similar way as h,. The input-output 
relationship in the matrix form in the absence of noise is given by 


yi[0] --- yilNer —1J) _ Aaa Aaa} [ti[0] tif[1] --- fl ‘tr salt) (6.174) 
y2|0) oes yo| Mer = 1] t2{0] t(1| vee to| Mer —] 


¥ H T 


hi2 hop 


or equivalently 


T. (6.175) 


Now consider the squared error term, which in the matrix case is written using the Frobenius norm 
. ; 
|All a ya yy lAli; as as 


2 


T ht . T ht a T me 
a= |e) tel | al ae Ah] aol ae (6.176) 

¥2 hy rp Y2 h5 Y2 h5 
= |ly: — T™hy||? + |ly2 — Tho]? (6.177) 


Because the cost function is separable (the minimizer of the first term is independent of the minimizer 
of the second term), it is possible to solve for each channel separately as 


h; = (T°T’)-!Ty,, for i = 1,2. (6.178) 
Equivalently, 
h; = arg min ||y; — Ta;||’, for i = 1,2. (6.179) 
a;E€C?2%!1 


Assume that T is full rank. The MIMO least squares channel estimate is given by 
h; = (T°TT)-!T°y;, for i = 1,2. (6.180) 


Writing in matrix form using (6.165), then 
H = YT*(TT*)"'. (6.181) 


In general, for a given SISO channel estimation error achieved with a length Ngjso training 
sequence, a MIMO system requires a length N,Vsjs9 matrix sequence to achieve the same estimation 


error performance. The overhead is only a function of the number of transmit antennas. For a given 
coherence time, the number of symbols during which the channel is constant is fixed. As the number of 
transmit antennas increases, the required training grows accordingly. For a large number of antennas, 
little time may be left to actually send data. This creates a trade-off between different system 
parameters where there is tension between the number of transmit antennas used, the time devoted to 
training, and the number of receive antennas. 


6.4.7 Going Beyond the Flat-Fading Channel to Frequency-Selective Channels 


The exposition in this section has focused so far on MIMO communication in flat-fading channels. 
MIMO communication is also possible in frequency-selective channels. This section reviews the 
MIMO frequency-selective channel model and describes how concepts like equalization are 
generalized in the MIMO setting. Emphasis is placed on spatial multiplexing. At present, commercial 
systems employing MIMO communication in frequency-selective channels use MIMO-OFDM, which 
is explored in more detail in Section 6.5. 


Frequency-Selective Channel Equalization 


VAL 
Let i. A) }eL0 denote the complex baseband discrete-time equivalent channel between the 


transmit antenna and the i“ receive antenna. The channel includes all scaling factors. Spatial 
multiplexing 1s used at the transmitter where the symbol stream {s;[n]} 1s sent on antenna j. The 


discrete-time baseband received signal (assuming synchronization has been performed) 1s 


L 
y(n] = ». hi 1[é]si[n — €] + hy2[élse[n — 4] +--+ + hin, [Alsn,[n — 4]. (6.182) 
é=0 
Compared with the SISO channel model in (5.87), in the MIMO case, each receive antenna observes 
a linear combination of all the signals from the transmit antennas, filtered by their respective 


antennas. This self-interference among the transmitted signals makes it hard to apply directly all the 
algorithms from Chapter 5 without substantial modifications. 


ray L f __ i . 
Define the multivariate impulse response {H\4) }¢0 where HHI4\)i,5 = i541 Then the received 
signal can be written in matrix form as 


y(n] = 4° Hfdjs[n — 4] + v[n). (6.183) 


This is the standard MIMO input-output equation for a frequency-selective fading channel. 
Because the channel is a multivariate filter, a linear equalization should also be a multivariate 
(Ry a 
filter. Let {G{k]} k—o denote an order K multivariate impulse response that corresponds to the 


i JK 
equalizer. A least squares equalizer would find a {G[k]} k=0 such that 


K 
>. G[k]H[n — k] = 6[n — ngJI. (6.184) 
k=0 


The least squares equalizer can be found using a similar approach to that in Section 5.2.2 using block 
Toeplitz matrices. Let Hxk.m denote the (L+ K+ 1) x (K+ 1) Toeplitz matrix constructed from 
hy nln] as in (5.91). Furthermore, let g; = [2% 9], Seml ll, .--, 2 mM KIN . Then the equalizer for 


recovering the m" stream can be found by solving 


S1.m 


£2.m “2s ™ 
Him Hom os Hy,,m| : = Ong (6.185) 


&N,,m 


withe ng 2S defined in (5.91). 


An interesting fact about MIMO equalizers is that, under certain circumstances, it is possible to 
find a perfect FIR matrix inverse. In other words, the least squares approach can have zero error. The 
exact conditions for finding a perfect inverse are described in [184, 367] and involve conditions on 
K, L, N,, and the coprimeness of certain polynomials. The intuition is that if NV, is large enough, then 


the block Toeplitz matrix in (6.189) will become square or even fat. In this case, assuming the matrix 
is full rank, either one (if square) or an infinite (if fat) number of solutions can exist. When more than 
one solution is available, common convention is to take the minimum norm solution, which gives the 
equalizer coefficients with the smallest norm and thus the least noise enhancement. These two 
solutions are explored in Example 6.15. 


Example 6.15 

Consider a frequency-selective slow-fading SIMO system where the transmitter has a single antenna 
and the receiver has two antennas. Assume that the channels from the transmit antenna to each receive 
antenna are frequency selective with Z + | taps. The received signals at the receive antennas are 
given by 


L 

yi[n] = So hi lé]s[n — 4] + vi [rn (6.186) 
&=0 
L 

yo[n] = >_ halé]s[n — ¢] + va{n}. (6.187) 


&=0 


Note that the receiver has two observations of the same signal. Each is convolved by a different 
frequency-selective channel. The receiver then applies a set of equalizers {91[k] }e=0 and 
{g2[] } ko such that 


K K 
5. gilklhi[n — k] + a go[klho[n — k] = d[n — ng]. (6.188) 


k=0 k=0 


k tye 
Formulate and solve the least squares equalizer problem given knowledge of {hai |] }¢—0 and 
{h2[é) boo. 


Answer: Based on (6.189), the coefficients of equalizer g, ; and g» ; are obtained by solving 


Hh Hp,1| ~ =p, (6.189) 


The solution depends on the rank and dimension of H. The dimensions are (L + K + 1)x(2(K + 1)). 
The matrix is square or fat if K > 2 — 1. The matrix 1s full rank in this case if the two channels are 
coprime, which means that they do not share any zeros in their Z transforms [184]. If K < L — 1, then 
the solution is the usual least squares equalizer estimate given by 


¢=(8°H)“"H'e;., (6.190) 


where g)) =[8]-1-x+1 and Qo; =[8]-«+2.2K+2. The optimum delay can be selected by finding the delay 
e* (I — H(H*H)"*H")e,, 


os 
tha 


that minimizes the residual error 


If K is chosen large enough such that K > L, then H is a fat matrix and there are an infinite number 
of possible solutions. It is common to choose from among those solutions the one that has the 
minimum norm |g\|, which is 


g = H* (HH")  en,. (6.191) 
* (*) 
The delay can be further optimized by finding the value of ng such that ©na ( ) nai 
minimized. 


Frequency-Selective Channel Estimation 
Channel estimation also generalizes to the MIMO case by forming a suitable least squares problem. 


Nis—1 bie 
Suppose that training sequence {tj [| }n=o is sent from antenna /. We focus on estimating the set of 


rN: 
channels {hi 514 jal from the observations at antenna 7. Writing the observed data as a function of 
the unknowns in matrix form following the approach in (5.173), let 


yi[L] 
Yi [L + 1] 
bf ; 
Yi [Mer 7 1] 


h,,;{0| 
hij = sa 
hi s(D 
v; [LD] 
v;[L + 1] 


Vi= 


vi| Nex — 1] 


Then the observation can be written as 


h; i 
hj.2 
yi=|Tr To Tw] |. 
———— : 
T hin, 
Nps 
h; 


The least squares channel estimate follows as 


h; = (T*T)“'T*y;. 


By. <a) 


(6.192) 


(6.193) 


(6.194) 


(6.195) 


(6.196) 


(6.197) 


Compared with the SISO case, the MIMO case estimates all the channels seen by receive antenna k at 
once using the fact that training sequences are sent simultaneously from each transmit antenna. 


Example 6.16 


Consider a frequency-selective slow-fading MISO system where the transmitter has two antennas and 
the receiver has a single antenna. Assume that the channels from each transmit antenna to the receive 
antenna are frequency selective with Z + 1 taps. The received signal is given by 


L 
yi[n| = » hy 3[4]si[n — 4] + hy2[€)sain — €) + v1 [n]. (6.198) 
f=) 


Ny r~— 1 p Nt _— 1 
The transmitter uses training sequences {tilM|]}n“o and {t2l”|}n“o . Solve for the least Squares 
channel estimate. 


Answer: The squared error can be written as follows: 


2 


Ner-1 L 

J(hy 1, hy.2) = S- yi|n| — \., hy1([@ti[n — €] + hy2[€ltel[n — (6.199) 
n=L f=0 

= |lyi — (Tibi + Tehi2)||. (6.200) 


saat a fa PG T 
Defining 1 = [by bi2) and f= [T1, T2] then the solution is 


h, = (T*T)"'T*y,. (6.201) 


Generalizations are possible for each of the impairments and the algorithms for dealing with them 
from Chapter 5 to MIMO frequency-selective channels. Direct equalizers can be formulated and 
solved using appropriately defined block matrices. Carrier frequency offset and frame 
synchronization are also important in MIMO systems. We defer discussion of MIMO-specific 
synchronization algorithms to the specific case of MIMO dealt with in Section 6.5. 


A block diagram of a MIMO communication system is provided in Figure 6.18. Many of the 
functional blocks serve the same purpose as in the SISO communication system but with added 
complexity because there are multiple inputs and multiple outputs. One of the functions with the 
highest complexity is the linear equalizer. Both estimating and applying the equalizer require a 
significant amount of computation. An alternative that has been widely successful in commercial 
systems is to use OFDM for its easy equalization properties. This is discussed in more detail in the 
next section. 
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Figure 6.18 A possible transmit block diagram for a MIMO system with spatial multiplexing 


6.5 MIMO-OFDM Transceiver Techniques 


MIMO-OFDM combines the spatial multiplexing and diversity features of MIMO communication 
with the ease of equalization when using OFDM modulation. Indeed, MIMO-OFDM is currently the 
de facto approach for MIMO communication. It is used in IEEE 802.11n [158] and IEEE 802.1lac 
[161]. A variation known as MIMO-OFDMA (orthogonal frequency-division multiple access) 1s used 
in WiMAX [157], 3GPP LTE [127], and 3GPP LTE Advanced. This section reviews MIMO-OFDM 
with an emphasis on its application with spatial multiplexing. After a review of the system model, the 
operation of different receiver functions is explained in more detail. Equalization and precoding are 
explained in MIMO-OFDM, where it is shown that they follow from the narrowband MIMO 
exposition. Different approaches for channel estimation are reviewed based on training data sent in 
the frequency domain. Least squares estimators are derived for the channel estimates and are found to 
have additional structure. Finally, a generalization of the Moose algorithm for carrier frequency offset 
synchronization and frame synchronization to MIMO-OFDM is described, assuming a single offset 
among all the transmit antennas. 


6.5.1 System Model 

Consider a MIMO-OFDM system with spatial multiplexing as illustrated in Figure 6.19 and Figure 
6.20. Denote the symbol stream (assumed to originate in the frequency domain) as s[7], the N, x 1 
vector symbol as s[7], and the corresponding subsymbols on antenna j as s;[”]. After the spatial 


multiplexing operation, each subsymbol stream is passed into a SISO-OFDM transmitter operation 
that consists of a 1:N serial-to-parallel operation followed by an N-IDFT and the addition of a cyclic 
prefix of length L,. Let w[n] denote the N, x 1 time-domain vector output of the cyclic prefix addition 


block and w,[] the samples to be sent on the j" transmit antenna. 


Spatial 
multiplex 


Figure 6.19 Block diagram of a MIMO-OFDM system transmitter. The spatial multiplexing 
operation happens prior to the usual OFDM transmitter operations. The output of each OFDM 
modulator is possibly pulse shaped, converted to continuous time, then upconverted. 


Figure 6.20 Block diagram of a MIMO-OFDM system receiver 


Consider the signal at the r" receive antenna assuming perfect synchronization: 
N; 


L 
yi[n] = >. by hy; |€)w;[n — €] + v,[n]. (6.202) 


j=1 €=0 


We proceed by following the steps in (5.161)—(5.167), exploiting the linearity of the DFT operation. 
Let 


L 
hi glk] = >_ hig[Qe7F (6.203) 


£=0 


denote the N-DFT of the (zero-padded) channel between the r“" receive antenna and the m" transmit 
antenna. Discarding the first L. samples and taking the V-DFT of the result gives 


Nt 
yiln] = S~ hij [k]Sm[k] + vi[n]. (6.204) 


pak 


Now define the matrix response [H[A]]; ; = h, |[A]. Equivalently, H[X] is related to H[7] through the 
N-DFT of the matrix channel: 
5 
H[k] = y; H[éje"9°8" (6.205) 


£=0 


Stacking the observations in (6.204) for i=1,2,..., N, gives the canonical MIMO-OFDM system 
equation 


y[k] = H[R|s[A] + vik], (6.206) 


Compared to the case where OFDM is not used in Section 6.4.7, (6.206) trades the multivariate 
convolution in (6.183) for a per-subcarrier matrix multiply. This leads to reductions in equalizer 
complexity and better performance in most cases. Compared with the special case of SISO-OFDM in 
(5.168), per-subcarrier equalization has higher complexity in the MIMO case. Furthermore, there are 
additional trade-offs to be made in terms of the choice of equalizer and detector and their resulting 
performance. 


The similarity of the MIMO-OFDM signal model in (6.206) and the flat-fading MIMO signal 
model in (6.112) is one of the primary reasons that so much emphasis in research is placed on flat- 
fading MIMO channel models. The main observation is that many flat-fading results (equalization, 
detection, and precoding) can be readily extended to MIMO-OFDM with a suitable change in 
notation. This underlies the relevance of the emphasis on flat-fading signal models in this chapter. 


6.5.2 Equalization and Detection 


The equalization options for MIMO-OFDM are similar to those of their flat-fading counterparts. A 
maximum likelihood detector would solve 


s|k] = arg min |ly[k] — H[A]s||? (6.207) 
seS 
fork =0,1,...,N—1. The brute-force search complexity is the same as in the flat-fading case. The 


main difference is that in the case of MIMO-OFDM, the candidate symbol vectors are multiplied by 
H[A], which is different for every subcarrier. This may make it harder to precompute all the possible 
distorted symbol vectors because of higher storage requirements. 


The zero-forcing detector operates in an analogous fashion as well. With G[k] = H[&]", the 
receiver first computes 


z|k] = G{kly[k), (6.208) 
then applies a separate detector for each stream to compute 


[nm] = arg min|z,[n] — ¢|?. (6.209) 
cEC 
Complexity can be reduced in a couple of different ways. First, instead of computing an inverse and 
then performing the product in (6.208), the solution can be found through a QR factorization [131]. 
Second, the relationship between adjacent inverses can be exploited to reduce the computational 
complexity need to find an inverse for every subcarrier [49]. 


Performance analysis of these detectors is more complicated ina MIMO-OFDM system as it 
depends on the channel model. Both delay spread and more generally the power delay profile impact 
the performance [46]. If each H[7] is IID with IID zero-mean Gaussian entries with the same 
variance, then H[x] are also IID with the same distribution. As a result, the maximum likelihood and 
zero-forcing detectors for spatial multiplexing achieve the diversity performance as predicted in 
Section 6.4.1. In theory, though, the total amount of diversity available is N,V.(L + 1); achieving this 


much diversity requires either coding and interleaving [6] or space-frequency codes [319]. 


6.5.3 Precoding 


Precoding generalizes in a natural way to MIMO-OFDM based on the relationship in (6.206). Let 
F[A] denote a precoder derived for channel H[x], and suppose now that s[k] is NV, x 1. Then the 


received signal with precoding is 
x[k] = H[k|F[k]s[k] + v[k] (6.210) 


fork =0,1,...,N—1. The receiver performs equalization and detection based on the combined 
channel H[A]F[x]. 

The concept of limited feedback can be used to convey quantized precoders from the receiver back 
to the transmitter using a shared codebook and applying quantization to each H[k]. The feedback 
overhead, however, grows with N since each subcarrier needs its own precoder. There are different 
approaches for avoiding this requirement. For example, a single precoder can be used for several 
adjacent subcarriers, exploiting the coherence bandwidth of the channel [229]. Alternatively, every 
K" precoder can be fed back and then interpolation used at the transmitter to recover the missing ones 
[68, 67, 251]. Another approach is to quantize the lower-dimensional temporal channel matrix [307], 
but this requires different quantization strategies. Despite the potential overheads associated with 
precoding, it is widely deployed with MIMO-OFDM in IEEE 802.11n, IEEE 802.1lac, WiMAX, 
3GPP LTE, and 3GPP LTE Advanced. 


6.5.4 Channel Estimation 


One of the critical receiver operations ina MIMO-OFDM system is channel estimation. This section 
reviews different approaches for estimating the channel based on known symbols inserted into the 
transmitted sequence. These strategies use either the time-domain received waveform y[7] or the 
oO 5 deere NUL 

frequency-domain received waveform y[] to estimate {H|k]} k=0 or LH|4| }70 and then determine 

= VUL 
{H[k]} k=0 through application of the N-DFT to each {i,j |!) $20. More information on channel 
estimation for MIMO-OFDM is found in the literature [198]. 


For simplicity of exposition, we focus on training data inserted into a single OFDM symbol. 


[, N-=-1 
Multiple symbols may be exploited by augmenting the appropriate matrices. Suppose that {t{n]}n=o 
is a known training sequence and that s[”] = t[n] forn =0, 1,..., N— 1. In this case N,, = N. The 


; — : Gee {w[n]}NtL-1 ‘al N+L~1 
time-domain signal after the cyclic prefix addition is n=0 . Treating n=0 as 


known information, the approaches in Section 6.4.7 can be exploited at the receiver to estimate the 


w|n) } 
JU N+L-1 ; ; 
channel based on {¥|"/}n=0 . The block size of N+ L in the MIMO-OFDM case is the only 
N+L-1 
significant difference. The resulting T; matrix (constructed in this case from {w[n]}, a is Nx (L 


+ 1) and y,; is N x 1. The time-domain least squares channel estimate follows from (6.197). Perhaps 


the only other notable difference in the least squares formulation in this case is that the performance 
of the estimator depends on the matrix constructed from the DFT of the training sequences. Having 


: Bee nV! . 
good correlation properties in {t{r]},<0 does not necessarily translate into good correlation 


N+L-1 
properties in the resulting {w[n]}n=0 Consequently, other specially designed sequences may be 
of interest [26, 126]. 


Now we consider algorithms that directly exploit the training sequence in the frequency domain. 
The received signal is 


y[k] = H[k]t[k] + vk] k= 0,1,...,N—1. (6.211) 


It is not possible to estimate H[x] from only t[A]; estimating H[A] directly requires training in 
multiple OFDM symbol periods. Here we focus on only one OFDM symbol; therefore, we consider 
es 
methods that estimate {H|/| }¢~o instead. This is motivated by the usual choice of N -s. L, which 
means there are N frequency-domain channel coefficients but only L + 1 time-domain coefficients. 
Writing (6.211) as a function of the time-domain channel requires a bit of care. First, we rewrite 
H[A] as a function of H[€] in matrix form: 


L 
H[k] = }- [ge (6.212) 
£=0 
Iy, 
é iL y, 
- (0 Hi) os H{L]| | | (6.213) 
eS * Ny. 
Now define the vector 
elk]? = 1 e~ 5H -j a (6.214) 


Rewrite the channel using the Kronecker product 


H/{k] = (0) H[i] --- H{L]| (e[k] @ In, ) (6.215) 
and compute 
vec(H/{0}) 
—_ 7 vec(H/1)) —— 
vec(H[k]) = ((e[k]" @In,) ® In, ) (6.216) 
vec(H|Z]) 
h 


Now we use Kronecker product identities to rewrite (6.211) as 


y[k] = vec(y[h}) ( 
= vec (H[k]t[k]) + vk] ( 
= (t{k]’ @ Iy.) vec(H[k]) + v[k] (6.219 
= (t[k]” @Iy,) ((e[k]? @Iy,) @Iy,) ht vik] (€ 
( 


= (e[k]? @t[k]? @Iy.) h + v[k). 6.221) 
eee 
T[k] 


The known values of the training can be used to build a least squares estimator. Suppose that pilot 
subcarriers are used. This means that training is known only at subcarriers K = {ky, ko, ..., Ay}. 
Training over the entire OFDM symbol K = {0, 1,..., N—1} is a special case. Stacking the 
observations in (6.221) gives 


y [ki] T[ky| viky) 
y [ky T [ko] vik) | 
a | Ol Patel (6.222) 
vied} [Tied] [vik 
* ul ty 


The dimensions of T are |A|N,. x V.N.(Z + 1). With enough pilot subcarriers, |K| can be made large 
enough to ensure that T is square or tall. Assuming a good training sequence design so that T is full 


rank, the least squares estimate is then computed in the usual way as h= (ery T*y, The final 
wanes .JYN=1 
step would be to reform {H\/] }¢~0 from h and then to take the N-DFT to find {H[]} io . While the 


complexity of the least squares solution seems high, keep in mind that i tas i a can be 
precomputed so only a matrix multiplication is required to generate the estimate. A final takeaway 
from this derivation is that the least squares estimator in Chapter 5 is powerful and can be applied to 
a variety of sophisticated settings in the context of MIMO-OFDM. 


There are many generalizations and extensions of least squares channel estimators for MIMO- 
OFDM that result in better performance and/or lower complexity. A general approach for channel 
estimates is described in [198], including using multiple OFDM symbols and simplified estimators. 
Achieving good performance requires choosing good training sequences on each antenna. Training 
sequence optimality and designs are proposed [198, 26, 126]. It has been found, for example, that 
rotated versions of Frank-Zadoff-Chu sequences [119, 70] have good performance because they 
satisfy certain optimality criteria and are constant modulus in both time and frequency domains. 


6.5.5 Carrier Frequency Synchronization 


Synchronization is an important operation for MIMO-OFDM communication systems. This section 

reviews MIMO-OFDM carrier frequency offset estimation, also providing some guidance on frame 
synchronization. The main approach is to generalize the Moose and Schmidl-Cox approaches from 

Chapter 5 that leverage periodicity in the transmitted signal. 


The extent of synchronization required ina MIMO-OFDM system depends on whether the transmit 
antennas and receive antennas are each (respectively) locally synchronized. Consider a system where 
each RF chain independently generates an imperfect version of f,. The transmit signal generated by 

(tx) 
the j'" transmit antenna is created by upconversion using a carrier frequency : fe J , whereas the 


(rx) 
downconverted signal at the i" receive antenna is demodulated using carrier frequency Fei . Let the 


-_ ( rx ) (tx ) 
normalized frequency offset be ©.) ~ (Fo, =i )T . Using the same logic as in Section 5.4.1, 
(6.202) with carrier frequency offset becomes 
Ny 


yiln] = Deere 2” gs ldugin — 4 0 [in]. (6.223) 


j=1 £=0 


Example 6.17 


Simplify the received signal equation in (6.223) for the case of flat fading with single-carrier 
modulation, and write it in matrix form. Explain how to correct for carrier frequency offset. 


Answer: Simplifying the case where L = 0 and assuming single-carrier modulation, 


yi (| -y ARIMA: os, + v,(n]. (6.224) 
Stacking this results in 
hy, e)27«11" hygel27e127" = hin, el2re1n,n 
site hg, e)27«21" hggel?™22" han, el2762Ne" teil evel. (6.225) 
h N,. eit Nrin fy wy gel?" Mra ae if N,N ei2me Nr Ny 
H[n] 


If estimates of the channel H and all the frequency offsets Léi, iis J=l are available, then an 
effective time-varying matrix H[n] can be constructed and used for joint carrier frequency offset 
correction and equalization. It should be clear that this approach is more sensitive to offset error, 
since even small errors can create big matrix inverse fluctuations. 


Even if the offsets {¢;;} can be estimated, they cannot be easily removed with a multiplication by e 


“2€:;" because the offsets show up inside the summation in (6.223). The problem is further enhanced 
by the observation that the offsets are different for eachi=1,2,..., N., making it challenging to 


apply any joint correction to y[”]. The problem is simplified somewhat if a common reference is used 
at the receiver, using joint equalization and carrier frequency offset synchronization [324] or more 
sophisticated estimation methods [181], but the resulting receiver complexity is still high. For this 
reason, it is important that a common reference be used to generate the carriers at both the transmitter 


and the receiver. 

Ifa common reference is used at each transmitter and receiver, then ¢; pe for all 7 and 7, and there 
is only a single offset. Then the offset can be pulled out of the sum in (6.223) since it no longer 
depends on. Since the offset does not depend on i, a matrix equation can be written as 

L 
y|n] = een S~ Hi{éw{[n — é)+ v[nl. (6.226) 


é=0 


Given an estimate of the offset €, correction of the offset in (6.226) amounts to generating the signal 


exp(—j27en)y|n| The remaining challenge is then to develop good algorithms for estimating the 
offset that work with multiple transmit and multiple receive antennas. 


The simplest generalization of the Schmidl-Cox approach is to send the same training sequence 
simultaneously from every antenna. If this is the case, then w[n] = 1w[n] where 1 is an AN, x 1 vector 
of all 1s and the receive signal in (6.226) simplifies as 


L 
y[n] = e!?7en >, H[é]1 w[n — €] + v[n) (6.227) 
é=0 Ea 
so that each received signal has the form 
L ~ 
y;[n] = e!?@™ ss. h,[@\wi[n — €) + v;[n]. (6.228) 


é=0 
Assuming for simplicity of exposition that the odd subcarriers are zeroed such that 


w[n+ D,] = w[nt+2L.+ N/2], (6.229) 


then with ¥il"] = yilr + Le (in the absence of noise) 


y;[n + N/2] = e2reN/25, [n] (6.230) 


forn=0,1,...,MW/2—l1andi=1,2,...,N,. Asimple estimator using least squares follows as 


N/2- 1 Ny —* f AT/9]s. 
e= p Serle yy Yi i" + N/ 2| 9; [7 ; (6.231 ) 
TIN 
The integer offset can be estimated in a similar manner with a second training sequence. While this 
approach is simple, sending the same training sequence on all the antennas means that the approaches 
employed in Section 6.5.4 cannot be applied to estimate the channel. 


An easy generalization of the periodic training structure is to send training data separately on the 
transmit antennas orthogonally in time. For example, one periodic OFDM symbol could be sent 
successively from each transmit antenna. Assuming the training 1s sent successively from each 
transmit antenna /, then 


y(n + (7 —1)(N + ZL) 


L 
= eltrentG-DINFL Shy [win — + vin +(7-1(N+Le))- (6.232) 


£=0 


Using the Yi "| notation as before (in the absence of noise), 


gi[n + N/2+(j —1)(N + L,)| = &?7/2G,[n + (fj —1)(N + Le) (6.233) 


forn=0,1,...,MW2-1,i=1,2,...,N,,and7=1,2,..., N,. Again, a least squares estimator can 
be formulated as 


oe | ds Dyk Bln +. N/2+ (9 — (N+ Le)}giln + (9 — IN + Le)] 


TN 
(6.234) 


Compared to (6.231), the approach in (6.234) offers improved performance through additional 
averaging over time. Note that the indexing would change slightly if a second OFDM symbol were 
sent from each antenna for the purpose of performing the integer offset, but the idea remains the same. 


The carrier frequency offset estimator can be further generalized by allowing simultaneous 
transmission of training sequences from each transmit antenna. This ensures that each antenna is 
operating at its peak power (there are per-antenna power constraints in practical MIMO systems) but 
adds further complexity in the resulting estimators. Joint design of training sequences for channel 
Discussions of system implementations with carrier frequency offset estimation are found in [318, 
342, 47]. 

While we did not consider frame synchronization explicitly, the approach for carrier frequency 
offset estimation can be leveraged to perform frame synchronization as well. Performance can 
improve over the SISO case because noise is averaged across different receive antennas, leading to a 
better estimate. 


6.6 Summary 


¢ SIMO communication systems use a single transmit antenna and multiple receive antennas. 
There are different ways to combine the outputs of each antenna, including antenna selection 
and maximum ratio combining. These approaches achieve second-order diversity in IID 
Rayleigh channels. Most receiver processing algorithms for SISO systems extend without major 
changes to SIMO. 


¢ MISO communication systems use multiple transmit antennas and a single receive antenna. 
Taking advantage of the transmit antennas is more difficult than in the SIMO case. To achieve 
the most benefits, either space-time coding or transmit beamforming is required. Space-time 
coding spreads signals across the antennas in a special way but usually requires higher- 
complexity decoding. Transmit beamforming can be implemented efficiently using limited 
feedback, where the best beamforming vector is selected from a codebook known to both the 
transmitter and the receiver. Transmit beamforming can also be implemented using channel 


reciprocity, where the receive and transmit channels are reciprocal because of the use of 
frequency-division duplexing. 

¢ MIMO communication systems use multiple transmit and receive antennas. Spatial multiplexing 
is the most common transmission technique in MIMO systems. 


¢ Precoding is the generalization of transmit beamforming to the MIMO setting. The optimum 
precoder comes from the dominant right singular vectors of the channel. The precoder can also 
be designed using limited feedback from a precoder codebook known to both the transmitter 
and the receiver. Antenna subset selection is a special case of limited feedback where the 
codebook consists of different subsets of transmit antennas. 


¢ Channel estimation in MIMO systems requires sending different training sequences on each 
transmit antenna. When least squares is performed, the channels to each receive antenna can be 
estimated separately. 


¢ Equalization is more challenging in MIMO systems. Linear equalization in the time domain 
would require implementing a multivariate equalizer, which usually has high complexity. As a 
result, MIMO-OFDM is common. With MIMO-OFDM, equalization and detection strategies for 
flat-fading MIMO systems can be used per subcarrier. 


¢ Carrier frequency offset is a more significant problem in MIMO systems compared with SISO 
systems. Locking the transmit carriers and receive carriers simplifies the problem. 
Synchronization techniques like the Schmidl-Cox approach and the Moose algorithm extend to 
the MIMO setting with some modifications. 


Problems 

1. Look up cyclic delay diversity as used in IEEE 802.11n and explain how it works. 

2. Consider a flat Rayleigh fading MISO system that employs delay diversity with NV, = 2 N,= 1 to 
send data {$|] at. 
(a) Supposing that s[—-1] = 0 and s[N] = 0, form the 2 x 101 space-time codeword matrix S. The 

length is 101 so that symbol s[99] sees both transmit antennas. 

(b) Show that this space-time code has full diversity. 

3. Consider a SIMO system with flat fading described by 


y(n] = hs{n] + vin]. (6.235) 
Suppose that the v[7] is zero mean with covariance R,. Determine the receive beamforming 
vector w that maximizes 


|w*h|? 


[|w*v[n]|?- 


SNRin = F (6.236) 


You may assume that R,, is invertible. 


4. Consider MMSE receiver beamforming in a SIMO system with flat fading described by 
y(n] = hs{n] + vin]. (6.237) 


Suppose that v[] is zero mean with covariance R,, s[m] is a zero-mean unit variance IID 


sequence, and s[n] and v[n] are independent. Determine the receive beamforming vector w that 
minimizes the MMSE defined as 


E [|w*y[n] — s[n]|?] . (6.238) 


Show how the result is simplified in the case where R, = I, and explain the differences with 
MRC. 


5. Consider a SIMO system with a single-path channel with 0 = 1/4 and a = 0.25e"3, 
(a) Determine the MRC beamforming solution. 


(b) Determine an expression for |w*h,?. 


(c) Generate a polar plot of |a(d)*h,/. Interpret your results. 


6. Three well-known diversity combining methods are maximum ratio combining (MRC), 
selection combining (SC), and switch-and-stay combining (SSC). Compare and contrast these 
techniques. In a flat-fading channel, what is the impact of each method on the symbol error rate; 
that 1s, how is the probability of symbol error different from the symbol error rate for a SISO 
channel? Note: You may need to consult additional references to solve this problem. 


7. Consider the groundbreaking space-time code devised by Alamouti, used ina MIMO system 
with N, = 2 and N, receive antennas. Derive the pairwise error probability for the Alamouti 


code and show that the diversity order is 2N.. 


8. Derive the expression for the pairwise error probability of space-time coding with N, receive 
antennas. Hint: Write (6.92) for an N, < N, matrix H instead of a 1x, matrix h*, then use 
properties of the Kronecker product. 

9. Consider 4-QAM transmission. Using Monte Carlo simulations, plot the probability of symbol 
error for the following transmission strategies up to error rates of 107°: 
¢ Gaussian channel 
¢ Rayleigh fading channel 


¢ SIMO Rayleigh fading channel with antenna selection (select the antenna with the best 
channel) with two antennas 


¢ SIMO Rayleigh fading channel with optimum combining with two, three, and four antennas 
¢ MISO Rayleigh fading channel with the Alamouti code 

Hint: To perform this simulation, you only need to simulate the discrete-time system. For 
example, for the Gaussian channel you could simulate 


y(n] = VE, s{n + y[n]. (6.239) 
You should generate approximately V = 100 =< 1/P, symbols where P, is the minimum target 
error probability. Then scale the symbols and add to each symbol an independently generated 
noise relation. Perform detection and count the errors. The average number of errors divided by 
Nis your estimate of the probability of symbol error. 


Based on your results, answer the following questions: 


(a) Determine the small-scale fade margin for each technique at a symbol error rate of 10°. 
(b) What is the benefit of optimum combining versus antenna selection? 
(c) What is the benefit of antenna selection versus the Alamouti code? 


(d) What is the impact of increasing the number of receive antennas? How does the increased 
diversity benefit the system? 


10. Consider the Alamouti code with two transmit and two receive antennas. Derive the optimum 
combiner for the case where two receive antennas are used. Essentially, write the equations for 
the received signal on each antenna. Stack them together and perform the spatial matched filter. 
Then show how the observations on each antenna can be combined to get better performance. 
Explain the diversity order. 


11. Derive the maximum likelihood decoder for a spatial multiplexing system with N, = 2 and 
where different constellations C, and C, are used. 

12. Computer Develop a flat-fading MIMO channel simulator. Your simulator should generate a 
flat-fading matrix channel of dimensions WN, x N, for the following channel models: 
(a) IID Rayleigh fading 
(b) Spatially correlated Rayleigh fading with input covariances R,, and R,., 


(c) Rank-one LOS channel where H = aa(@)a*(o) where a is Ne(0,1) and a(-) is the uniform 
linear array response vector 


(d) Clustered channel model in (6.114) with inputs C, R, and Laplacian power azimuth 
spectrum in (6.13) 

13. Computer Using Monte Carlo simulations, plot the probability of symbol error for a 4-QAM 
SIMO flat-fading channel with given distribution down to a probability of symbol error of 107 
for N, = 1, 2, 4, 8. Compute the following and interpret your results. What is the effect of 
correlation? How does it behave for larger numbers of antennas? 

(a) IID Rayleigh fading 
(b) Spatially correlated Rayleigh fading using the exponential correlation model with p = 
0.5ei"/3 

14. Computer Using Monte Carlo simulations, plot the probability of symbol error for a 4-QAM 
MISO flat-fading channel down to a probability of symbol error of 10> for N, = 2 for the 
following signaling strategies. Assume IID Rayleigh fading. Interpret your results. 

(a) Maximum ratio transmission 

(b) Equal gain transmission (take the phase of the MRT solution) 

(c) Transmit antenna selection (choose the best transmit antenna and put all your power there) 
(d) Limited feedback transmit beamforming with the LTE codebook in (6.69) 

(ec) Alamouti code 


15. Computer In this problem, you compare the performance of different MIMO configurations 
and performance with two transmit antennas and two receive antennas. To make comparisons 


fair, we keep the rate (size of the vector constellation) fixed within each strategy. Using Monte 
Carlo simulations, plot the probability of symbol error for the following transmission strategies 


up to error rates of 107° in an IID Rayleigh fading channel: 
¢ Alamouti code with 16-QAM 
¢ Spatial multiplexing with 4-QAM for each data stream with a zero-forcing receiver 


¢ Spatial multiplexing with 4-QAM for each data stream with a maximum likelihood 
receiver 


Hint: To perform this simulation, you only need to simulate the discrete-time system. For 
example, for spatial multiplexing, you need to simulate 


y|n| = Hs[n] + vin] (6.240) 


where H has IID Rayleigh fading and path loss is neglected. You should generate about 100 
noise realizations and for each noise realization 100 channel realizations for each simulation 
point. This means 100 < 100 simulations per SNR point. You might get by with less to get a 
smooth curve. Also, just as a reminder: 


¢ Spatial multiplexing with a zero-forcing receiver involves computing (H) 'y[n] and then 
detecting each substream separately. Note that you need to compute the vector error; that 1s, 
an error is counted if either or both of the symbols sent at the same time differ from what 


was transmitted. In other words, there is an error if s[n| # Slr u 
¢ Spatial multiplexing with a maximum likelihood receiver involves computing arg 


. / De 12 
Minges | —,/—*Hs\l- 

uinses lly[n] VN s| where Sis the set of all possible vector symbols, for 
example, all possible symbols sent from antenna | and all possible symbols sent from 


antenna 2. In this case it has 16 entries. Again, compute the vector error rate. 


Because we compare 16-QAM transmission and spatial multiplexing with 4-QAM 
transmission, the rate comparisons are fair. 


(a) For what SNRs is the Alamouti technique preferred to spatial multiplexing? 


(b) What is the performance difference between zero forcing and the maximum likelihood 
receiver? 


(c) What is the complexity difference between zero forcing and the maximum likelihood 
receiver? 
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